Global patterns of potential future plant diversity hidden in soil seed banks

Yang, Xuejun; Baskin, Carol C.; Baskin, Jerry M.; Pakeman, Robin J.; Huang, Zhenying; Gao, Ruiru; Cornelissen, Johannes H. C.

doi:10.1038/s41467-021-27379-1

Download PDF

Article
Open access
Published: 02 December 2021

Global patterns of potential future plant diversity hidden in soil seed banks

Xuejun Yang ORCID: orcid.org/0000-0002-8595-545X¹,
Carol C. Baskin^2,3,
Jerry M. Baskin²,
Robin J. Pakeman⁴,
Zhenying Huang¹,
Ruiru Gao⁵ &
…
Johannes H. C. Cornelissen⁶

Nature Communications volume 12, Article number: 7023 (2021) Cite this article

12k Accesses
42 Citations
89 Altmetric
Metrics details

Subjects

Abstract

Soil seed banks represent a critical but hidden stock for potential future plant diversity on Earth. Here we compiled and analyzed a global dataset consisting of 15,698 records of species diversity and density for soil seed banks in natural plant communities worldwide to quantify their environmental determinants and global patterns. Random forest models showed that absolute latitude was an important predictor for diversity of soil seed banks. Further, climate and soil were the major determinants of seed bank diversity, while net primary productivity and soil characteristics were the main predictors of seed bank density. Moreover, global mapping revealed clear spatial patterns for soil seed banks worldwide; for instance, low densities may render currently species-rich low latitude biomes (such as tropical rain-forests) less resilient to major disturbances. Our assessment provides quantitative evidence of how environmental conditions shape the distribution of soil seed banks, which enables a more accurate prediction of the resilience and vulnerabilities of plant communities and biomes under global changes.

Vegetation structure determines the spatial variability of soil biodiversity across biomes

Article Open access 09 December 2020

Enhanced stability of grassland soil temperature by plant diversity

Article 04 December 2023

Water availability creates global thresholds in multidimensional soil biodiversity and functions

Article 11 May 2023

Introduction

Soil seed banks are vital for the long-term survival of individual plant species and the diversity and dynamics of plant communities¹. Thereby, they represent a critical but hidden stock for potential future plant diversity on Earth. Seed banks, which include all viable seeds on or in the soil, vary spatially and temporally². Ecological and evolutionary theory recognizes seed banks as ‘biodiversity reservoirs’. Indeed, seed banks support population persistence and biodiversity maintenance through temporal storage effects³ and increasing the gene pool⁴, thereby maintaining a diverse but hidden species pool belowground that hedges against risk of environmental change³. Further, seed banks can affect the potential rate and even direction of evolutionary change because they increase the mean generation times of populations^5,6. Therefore, clarifying the functions of seed banks in community and population dynamics is a key challenge for understanding basic ecological patterns and processes⁷.

Patterns and variation of soil seed banks have long been of much popular interest⁸. Despite the extremely heterogeneous nature of soil seed banks⁹, most studies have been conducted at the local scale, which hampers our general understanding of the assembly processes of this biodiversity reservoir at large scales¹⁰. The few recent studies that have reported on patterns of soil seed banks at macroscales have been conducted in certain regions (e.g., Europe¹¹) or at the global scale for a specific plant group (e.g., invasive species¹²) or ecosystem (e.g., grasslands^10,13).

Further, the very low similarity between soil seed banks and the standing vegetation has been widely recognized^6,9,10; thus, both environmental determinants and responses to global change differ fundamentally between them. Given the predicted impacts of global change on biodiversity, effective management of global diversity requires a complete understanding of the response of plant diversity to environmental changes both aboveground (standing vegetation) and belowground (storage organs, bud banks and soil seed banks). Soil seed bank diversity and density represent much of the resilience of local to biome-scale plant diversity in the face of major disturbances linked to climate or land-use changes. Fully understanding the geographical distribution and environmental determinanats of soil seed banks¹⁴, and modeling their role in future plant diversity requires a global assessment that disentangles the effects of environmental gradients on soil seed bank diversity and density.

Here, we provide such assessment by compiling and analyzing an extensive database to characterize global determinants and patterns of soil seed banks. Close relationships between the soil seed banks and environmental variables (including climate and soil) have been reported, albeit mostly at the local or regional scale^5,9,10. We hypothesized that soil seed bank composition and density should show clear global patterns since environmental conditions vary geographically across the Earth. Biologically, since seed dormancy and longevity in the soil are determined by temperature, precipitation and soil environments¹ and climate and soil drive plant productivity that in turn should drive seed influx into the soil, we further hypothesized that climate and soil variables are important for predicting soil seed bank diversity and density at the global scale. The main results of our study show that diversity of soil seed banks exhibits clear latitudinal patterns. Climate and soil are the major determinants of seed bank diversity, while net primary productivity and soil characteristics are the main predictors of seed bank density. These results provide insights into environmental determinants of soil seed banks at the global scale.

Results and discussion

Our global database was derived from studies measuring soil seed bank diversity and density of natural plant communities across all continents, albeit with a strong data availability bias towards North America, Europe, eastern Asia and Oceania as compared to elsewhere (Fig. 1). The database contains 15,698 records for soil seed banks worldwide, including 6,480 for diversity (represented here by species richness) and 9,218 for density (number of seeds per soil surface area). The database represents more than a century of research with the oldest publication dating back to 1918¹⁵. This most exhaustive and comprehensive set of research data on soil seed bank to date allowed us to identify the determinants and patterns of soil seed bank at the global scale.

**Fig. 1: Locations of the soil seed bank studies included in our database.**

To make data among studies comparable, we standardized them using a three-step process. First, we identified soil seed banks that showed seasonal patterns in both diversity and density, all of which peaked slightly in winter (Supplementary Fig. 1a, b). Thus, we standardized all data (from non-(sub-)tropical regions) for other seasons to winter. Second, sampling area for soil seed bank diversity varied among studies, with 0.01 m² being the most commonly reported (Supplementary Fig. 2), to which we standardized all data using a species-area curve (Supplementary Table 1). Third, sampling depth also varied among studies, with 0–5 cm being the most frequently reported soil depth (Supplementary Fig. 3). Therefore, 0–5 cm was chosen as the soil depth for standardization of data in various soil depths. Such standardization is needed to find the relationships of seed bank data between different soil depths. We used the upper and lower limits of soil depths (e.g., for 0–5 cm, the upper limit was 0 cm and the lower 5 cm). The log-scale regressions showed that both soil seed bank diversity and density decreased significantly with lowering upper boundaries of soil depths but increased with lower ones (Supplementary Table 2), and thus we standardized all data to 0–5 cm depth using these relationships. To account for possible variation among biomes, the second and third standardization procedures were conducted for each biome separately. The analyses during standardization confirmed the need to standardize empirical findings when comparing seed bank patterns across studies, as previously stressed in a study on grassland soil seed banks¹⁰. Our standardization procedures made all data comparable in terms of season, sampling area and soil depth.

Non-parametric Kruskal–Wallis tests showed that soil seed banks differed significantly among ecosystem types. Mangroves, tundra and tropical & subtropical dry broadleaf forests had a lower diversity of soil seed banks, whereas Mediterranean forests, woodlands & scrub, tropical & subtropical moist broadleaf forests and tropical & subtropical coniferous forests had a higher diversity (Supplementary Fig. 4a). For density, mangroves and flooded grasslands & savanna had the lowest value, while temperate broadleaf & mixed forests and temperate conifer forests had the highest value (Supplementary Fig. 4b).

Prior to spatial analyses, we computed semivariograms to determine whether spatial autocorrelation could affect our models. We found that there was no obvious spatial autocorrelation in the data of soil seed bank diversity or density (Supplementary Fig. 5), indicating no spatial dependence in our data. We then used the random-forest algorithm (see Methods for details) to determine the importance (as increase in node purity) of the influence of 31 variables related to climate, soil, human disturbance and spatial coordinates (Supplementary Table 3) on diversity and density of soil seed banks. These variables previously were reported to affect plant performance at the global scale^16,17,18, and thus they could affect soil seed banks via their effects on seed production. Moreover, we expected that potentially these variables could affect seed longevity in the soil. Full models using all 31 predictors showed that climate and soil were important in predicting soil seed banks (Fig. 2a and Supplementary Fig. 6). Moreover, spatial coordinates (absolute latitude) were the most important predictor for diversity, i.e., diversity of soil seed banks exhibit clear spatial patterns at the global scale. Net primary productivity (NPP) and soil characteristics were important in predicting the density of soil seed banks (Fig. 2b and Supplementary Fig. 6).

**Fig. 2: Variable importance (increase in node purity) of random forests run with all 31 predictors.**

We then built final random-forest models using the most important predictors of seed banks selected from full models: nine variables for diversity and five for density (Supplementary Fig. 7). Final models explained more of the total variance than did full models (Supplementary Table 4), and they were robust to K-fold cross-validation (Supplementary Fig. 8), indicating that a small number of variables predicted soil seed bank diversity and density. Absolute latitude (abs.latit) was the most important predictor for diversity, which varied between 0–55° and then decreased beyond this range (Fig. 3a). Five climatic variables were important for diversity. Diversity peaked at intermediate annual temperature ranges (ATR), while it was the lowest at intermediate mean temperature of driest quarter of the year (TDQ), precipitation of the coldest quarter (PCQ) and precipitation of the driest quarter (PDQ). Diversity increased with increasing annual precipitation (AP). In addition, three soil variables were important for diversity. Diversity showed a humped relationship with soil pH, with pH 6–7 having the highest diversity. Diversity increased with soil cation exchange capacity (CEC) and soil silt content (SILT). These results indicate that diversity exhibits strong spatial patterns at the global scale. However, our spatial patterns differ from those found for a specific ecosystem worldwide (e.g., grasslands), where there were only weak latitudinal gradients in seed bank diversity¹⁰. In addition, climate emerged as an important predictor for seed bank diversity, which is consistent with the report that climate acts as environmental filters affecting soil seed bank of grasslands around the world¹³. Our results agree with a continental study in Europe, where ATR was more important than mean annual temperature for determining seed bank richness and warmer temperatures were associated with lower seed bank richness¹¹. Possible mechanisms by which temperature affects soil seed banks are that it (1) influences seed bank inputs via its effects on seed production; (2) cues dormancy-breaking and germination¹, thus determining germinable seed output from seed banks; and (3) affects seed metabolic activity and soil fungal activity¹⁹, thereby determining seed viability and persistence in the soil. Finally, our findings of a significant effect of soil pH are supported by some regional and local studies. For instance, seed bank composition is significantly associated with soil pH at high elevations on the Tibetan Plateau²⁰. A negative effect of low pH also has been reported in a large-scale study of acidic and calcareous grasslands in England²¹. Two possible mechanisms for the effects of soil pH are that (1) low pH may cause loss of seed viability due to the toxicity from aluminum or other metals that become more readily available in soils with low pH²²; and (2) high pH may accelerate decomposition and promote growth of pathogens that negatively affect seed persistence²³. In our study, the two mechanism may operate synchronously, thereby resulting in the highest diversity of soil seed banks at intermediate pH at the global scale. Further, our results show that soil CEC and SILT affect seed bank diversity, which agrees with a study on the Tibetan Plateau²⁰. The physical and chemical properties of soils can affect seed bank directly by affecting seed germination and aging via regulating soil water-holding capacity²⁴, or indirectly by affecting seed viability via controlling the activity of soil pathogens^21,22,25.

**Fig. 3: Partial feature contributions (the marginal effect of a variable on response) of the most important variables for soil seed banks.**

For soil seed bank density, soil bulk density (BULK) was the most important predictor; density increased below 750 g/cm³ BULK but remained stable when BULK was higher than 800 g/cm³ (Fig. 3b). Density peaked when temperature of the warmest month (TWM) was 34 °C. Density showed similar variation with NPP, precipitation of the driest quarter of the year (PDQ) and of the driest month (PDM), i.e., it peaked at intermediate values of these variables. Precipitation influences the success of sexual reproduction of plants and the size of the seed bank through seed input²⁶, and it also affects soil pathogenic fungi, which cause seed mortality²⁷. Therefore, precipitation has a strong effect on seed bank density, as reported for 27 alpine meadows on the Tibetan Plateau²⁸. Our results further illustrate that PDQ and PDM are the key factors determining seed bank density worldwide, suggesting that moisture fluctuation in soils triggered by precipitation of the driest time of the year can affect seed bank density. If soil moisture fluctuations are high, seed germination will be primed by increasing moisture²⁴.

At the global scale, we mapped soil seed bank diversity and density using the final random-forest models. Mapping soil seed bank values onto global maps revealed considerable geospatial variation, the pattern of which varied between diversity and density (Fig. 4). For diversity, western North America, central South America, central Africa, central Europe, southern and eastern Asia and eastern Oceania had high values. In contrast, eastern and central North America, northern Africa and central Asia had low values (Fig. 4a). For density, northern North America, northern Europe and northern Asia had higher values than elsewhere (Fig. 4a). Our results are consistent with the reports that larger seed banks are more common in cooler temperate climates^19,29. The latitudinal pattern of higher density in colder regions in the Northern Hemisphere may be driven by lower seed mortality in colder soils⁶, resulting in stable seed bank densities of long-lived seeds that counteract low seed production in some years at cold northern latitudes, as shown in a study of temperate forests along a 1900 km latitudinal gradient in northwestern Europe²⁹. The latitudinal pattern highlights that particularly species rich low-latitude biomes such as tropical rainforests generally have very low seed bank densities, while their seed bank diversity does not exceed that in higher latitudes biomes. However, our global assessment should be interpreted with caution since some studies in azonal vegetation or in rare habitats in our database did not fully reflect soil seed banks in that region, and thus these data shortcomings may have induced bias in our global predictions. Moreover, data gaps in our database are also likely to have had an effect on the global predictions, i.e., fewer data available from some continents (e.g., northern Asia and Africa) could lead to less confidence for prediction in these regions. For example, Russia has very few soil seed bank data, which may have led to an inaccurate prediction for this country. Nevertheless, based on our global patterns of soil seed bank diversity and density, the latitudinal pattern strongly suggests that the biodiversity of (sub-)tropical forests is particularly vulnerable to large-scale climatic or land-use disturbances. However, in-depth investigation is needed to quantify the extent to which temporal integration of seed bank effects for long-lived trees and seed masting events may buffer the effects of low seed bank diversity and density at any given time of sampling. In contrast, the higher-latitude plant diversity, while currently low compared to that in tropical rainforest, may rely on high soil seed bank densities to boost its resilience to large-scale climate- or land-use induced disturbances. Further, our analyses suggest that the least vulnerable ecosystems in terms of hidden diversity should be those that combine high seed-bank diversity with high density; and therefore the relationships between the two variables across the global map certainly would be an interesting topic worthy of further study.

**Fig. 4: Extrapolated global maps of soil seed banks.**

Our global assessment reveals that both diversity and density exhibit clear spatial patterns of soil seed banks but differ in their environmental determinants. These findings alone do not necessarily mean that this biodiversity reservoir has strong buffering capacity under climate change, because both climate and soil conditions influence seed bank diversity and density. Based on a large number and long history of studies globally, we provide quantitative evidence of how environmental conditions shape soil seed bank distributions and spatially explicit maps of this biodiversity reservoir in plant communities worldwide. Our quantification of environmental determinants and global mapping can be readily applied to dynamic global vegetation and plant diversity models to enable a more complete and accurate prediction of the impact of ongoing environmental changes on plant diversity (both above- and belowground) at the global scale. The next research challenge will be to plot current (visible) aboveground plant diversity (ideally using the available data in the studies themselves) against soil seed bank diversity under global change scenarios in order to pinpoint even more accurately which plant communities, ecosystems and biomes (and their turn-over) are most at risk of losing their diversity due to global changes.

Methods

Global data of soil seed bank

To identify published studies on soil seed banks worldwide, we conducted an ISI Web of Science search covering the time period from 1900 onwards using the following search terms: (“soil seedbank” OR “soil seed bank” OR “soil propagule bank” OR “soil stored seed” OR “buried viable seed”) AND (composition OR richness OR diversity OR “species number” OR density OR abundance). We updated the search several times during the last few years, and the latest update was in May 2021. The total return was 2,166 publications. In addition, we conducted a literature search in the China National Knowledge Infrastructure (CNKI) to identify publications in Chinese. The abstract of each publication was read individually to assess suitability of the study before obtaining the publication, and the reference list of each publication collected was inspected to identify additional relevant publications. Finally, we pre-selected a total of 1774 publications on soil seed banks worldwide (1,472 in English and 302 in Chinese).

To avoid bias in publication selection, only those studies were selected that met all of the following criteria. (1) Samples were collected from natural vegetation, and the results reported at least one data point on diversity and/or density of the soil seed bank. Old-fields abandoned for longer than five years were considered because they resemble natural vegetation, while weed/crop experiments were not included because agricultural seed banks reflect cultivation and cropping patterns and thus any environmental control is secondary. (2) Studies were included only when diversity or density were measured at the whole community level (i.e., all species in a community). (3) Only studies conducted in terrestrial ecosystems were included. In total, 1,502 publications met the above criteria (Supplementary Data 1 and 2).

For studies that included different levels of natural gradients (e.g., different ecosystems, soil depth, sampling time or topographic and moisture gradients), data for these levels were considered as independent. If environmental conditions were manipulated in a study (e.g., herbivory, nutrients, warming or CO₂), we extracted only data from the treatment that most closely reflected the situation under natural conditions. In addition, we excluded review/synthetic papers and used only studies that reported primary field data. We extracted data from the text, tables, digitized graphs and supplementary materials.

Statistical analyses

All statistical analyses were performed with the open-source language R (version 3.4.3, https://cran.r-project.org/).

Sampling time, area and soil depth differed both within and among studies, which might induce biases when comparing data. To account for such biases, we did three things. First, we divided the data into different seasons according to sampling time (i.e., spring, summer, autumn and winter). Then, we standardized all data to the season with the highest value (winter) by calculating an average ratio between that season and winter and then multiplying by that ratio. In this way, we standardized all data to the time with highest value, which made all data comparable across different sampling times. Because tropical regions have low seasonality, data collected from (sub)tropical biomes were not standardized using this procedure. Second, we used a species-area curve (Eq. (1)) to account for the difference between sampling area for diversity⁶:

$${S}={{{{{\rm{C}}}}}}{A}^{Z},$$

(1)

where S is the number of species (diversity), C a fitted constant, A the sampling area and Z a fitted constant. We used pooled diversity data to estimate the parameters and standardized all diversity to the most commonly reported area (0.01 m²). To minimize the bias caused by extreme values, the outliers in the data were identified by Rosner’s test using EnvStats package³⁰, and outliers above the upper limit were capped with the value of the 95th percentile. Notably, the species–area relationship could have considerable geographical variation due to biomes³¹; thus, we modeled the species-area curve for each biome separately. For this, we extracted the biome type of each data point from the Terrestrial Ecoregions of the World (TEOW)³². Most studies reported density as number of seeds per m² of soil surface. Otherwise, we used sample area to extrapolate data to number of seeds per m². Third, we used linear regression models to determine relationships between seed bank data and upper and lower boundaries of sampling soil depths (slices), and estimated parameters were used to standardized data to the most commonly reported soil depth (0–5 cm), which made the data comparable. To account for the differences among biomes, we modeled these relationships for each biome separately. Further, to determine whether there was potential artifact of sampling bias, we compared seed bank diversity and density for each biome between Southern and Northern Hemisphere (Supplementary Table 5). Of the 9 comparisons for diversity, only 4 pairs are significantly different, among which 3 pairs actually have higher value in the Southern Hemisphere. For density, mean values were also not biased towards the Southern or Northern Hemisphere. These results clearly indicate that our global predictions of the higher soil seed banks in the Northern Hemisphere (Fig. 4) are unlikely to reflect an artifact of sampling bias between the Northern and Southern Hemispheres. We then used non-parametric Kruskal–Wallis tests to compare the differences in soil seed banks among biome types.

Spatial autocorrelation in primary data can lead to overoptimistic assessment of model predictive power^33,34. To account for this issue, we computed semivariograms to determine spatial autocorrelation patterns in our data prior to spatial analyses. To identify the key factors that determine the pattern of soil seed banks worldwide, we selected 31 global predictors previously reported to affect plant performance^16,17,18: 19 climatic indices, 8 top soil variables, 1 human footprint (a composite variable compiled on eight variables measuring the direct and indirect human pressures on the environment globally³⁵), 2 plant indices and 1 spatial coordinate (see Table S1 for sources of the predictors).

We implemented the random-forest algorithm to model the relationships between these predictors and soil seed banks. The random-forest model is a data-driven ensemble learning approach that averages over multiple regression trees, each of which uses a random subset of all the model variables to predict a response³⁶. Random-forest handles highly collinear predictors by spreading the importance of the variable across all variables³⁷. This approach runs efficiently on large data bases and has been successfully applied to global analyses^17,18. We first determined the influence of all 31 predictors on soil seed banks. Variable importance was ranked in terms of the increase in node purity, which is the decrease in the residual sum of squares that results from splitting regression trees using the variable. We also reported the percentage increase in mean squared error (MSE), which quantifies the increase in model error as a result of randomly shuffling the order of values in the vector. The random-forest algorithm was carried out using the R package randomForest³⁸. The full models (using all 31 predictors) were run using 100 regression trees each.

We then implemented a variable selection procedure using the R package VSURF³⁹, which used the random forests permutation-based score of importance and proceeded using a stepwise forward strategy for variable introduction. Specifically, a variable was added only if the decrease in error was larger than a threshold, i.e., the decrease in out-of-bag (OOB) error had to be significantly greater than the average variation obtained by adding noisy variables. The most important predictor variables for seed bank diversity and density were selected to build final models (Supplementary Fig. 7). We ran the random-forest algorithm using the final models. We found that final models explained higher variance than full models (Supplementary Table 4). We plotted the final variable response of soil seed bank to each of the most important predictors using the R package forestFloor⁴⁰.

To test the sensitivity of final model performance, we performed K-fold cross-validations that test the sensitivity of model predictions to the exclusion of random subsets from the training data. Cross-validation was implemented using the R package rfUtilities⁴¹. We ran 99 iterations that withheld 10% of the model training data. These tests showed that our training data had sufficient redundancy to ensure that our model conclusions were robust.

Finally, we derived global predictions of diversity and density of soil seed banks in the spatial resolution of grid cell of 5 arcmin-by-5 arcmin. We made predictions based on the final random-forest models and by using the same predictor variables for the global grid.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Data and references from which data were collected supporting the findings of this study are available in the Supplementary Data 1 and 2. Terrestrial Ecoregions of the World (TEOW) are publicly available on the World Wildlife Fund (WWF) website [https://www.worldwildlife.org/]. Climate data reported in this study are publicly available on the WorldClim database [https://www.worldclim.org/]. Soil data are publicly available on the SoilGrids system [https://soilgrids.org/].

Code availability

The R codes used for analyses are available in the Supplementary Software.

References

Baskin, C. C. & Baskin, J. M. Seeds Ecology, Biogeography, and Evolution of Dormancy and Germination. 2nd edn (Academic Press/Elsevier, 2014).
MATH Google Scholar
Simpson, R. L., Leck, M. A. & Parker, V. T. Seed banks: General concepts and methodological issues. In Ecology of Soil Seed Banks (eds Leck, M. A., Parker, V. T. & Simpson, R. L.) 3–8 (Academic Press, 1989).
Chesson, P. & Huntly, N. The roles of harsh and fluctuating conditions in the dynamics of ecological communities. Am. Nat. 150, 519–553 (1997).
Article CAS Google Scholar
Honnay, O., Bossuyt, B., Jacquemyn, H., Shimono, A. & Uchiyama, K. Can a seed bank maintain the genetic variation in the above ground plant population? Oikos 117, 1–5 (2008).
Article Google Scholar
Evans, M. E. K. & Dennehy, J. J. Germ banking: bet-hedging and variable release from egg and seed dormancy. Q. Rev. Biol. 80, 431–451 (2005).
Article Google Scholar
Vandvik, V., Klanderud, K., Meineri, E., Måren, I. E. & Töpper, J. Seed banks are biodiversity reservoirs: species–area relationships above versus below ground. Oikos 125, 218–228 (2016).
Article Google Scholar
Alexander, H. M. et al. Metapopulations and metacommunities: combining spatial and temporal perspectives in plant ecology. J. Ecol. 100, 88–103 (2012).
Article Google Scholar
Duvel, J. W. T. Seeds buried in the soil. Science 17, 872–873 (1903).
Article ADS CAS Google Scholar
Moore, P. D. Soil seed banks. Nature 284, 123–124 (1980).
Article ADS Google Scholar
Jabot, F. & Pottier, J. Macroecology of seed banks: The role of biogeography, environmental stochasticity and sampling. Glob. Ecol. Biogeogr. 26, 1247–1257 (2017).
Article Google Scholar
Plue, J. et al. Buffering effects of soil seed banks on plant community composition in response to land use and climate. Glob. Ecol. Biogeogr. 30, 128–139 (2021).
Article Google Scholar
Gioria, M., Le Roux, J. J., Hirsch, H., Moravcová, L. & Pyšek, P. Characteristics of the soil seed bank of invasive and non-invasive plants in their native and alien distribution range. Biol. Invas. 21, 2313–2332 (2019).
Article Google Scholar
Kiss, R., Deák, B., Török, P., Tóthmérész, B. & Valkó, O. Grassland seed bank and community resilience in a changing climate. Restor. Ecol. 26, S141–S150 (2018).
Article Google Scholar
Walck, J. L., Hidayati, S. N., Dixon, K. W., Thompson, K. & Poschlod, P. Climate change and plant regeneration from seed. Glob. Change Biol. 17, 2145–2161 (2011).
Article ADS Google Scholar
Brenchley, W. E. Buried weed seeds. J. Agric. Sci. 9, 1–31 (1918).
Article Google Scholar
Kreft, H. & Jetz, W. Global patterns and determinants of vascular plant diversity. Proc. Natl Acad. Sci. USA 104, 5925–5930 (2007).
Article ADS CAS Google Scholar
Liang, J. et al. Positive biodiversity-productivity relationship predominant in global forests. Science 354, aaf8957 (2016).
Article Google Scholar
Steidinger, B. S. et al. Climatic controls of decomposition drive the global biogeography of forest-tree symbioses. Nature 569, 404–408 (2019).
Article ADS CAS Google Scholar
Pakeman, R. J., Cummins, R. P., Miller, G. R. & Roy, D. B. Potential climatic control of seed bank density. Seed Sci. Res. 9, 101–110 (1999).
Article Google Scholar
Ma, M., Dalling, J. W., Ma, Z. & Zhou, X. Soil environmental factors drive seed density across vegetation types on the Tibetan Plateau. Plant Soil 419, 349–361 (2017).
Article CAS Google Scholar
Basto, S., Thompson, K. & Rees, M. The effect of soil pH on persistence of seeds of grassland species in soil. Plant Ecol. 216, 1163–1175 (2015).
Article Google Scholar
Pakeman, R. J., Small, J. L. & Torvell, L. Edaphic factors influence the longevity of seeds in the soil. Plant Ecol. 213, 57–65 (2012).
Article Google Scholar
Bekker, R. M., Knevel, I. C., Tallowin, J. B. R., Troost, E. M. L. & Bakker, J. P. Soil nutrient input effects on seed longevity: a burial experiment with fen-meadow species. Funct. Ecol. 12, 673–682 (1998).
Article Google Scholar
Long, R. L. et al. The ecophysiology of seed persistence: a mechanistic view of the journey to germination or demise. Biol. Rev. 90, 31–59 (2015).
Article Google Scholar
Blaney, C. S. & Kotanen, P. M. Effects of fungal pathogens on seeds of native and exotic plants: a test using congeneric pairs. J. Appl. Ecol. 38, 1104–1113 (2001).
Article Google Scholar
Ooi, M. K. J. Seed bank persistence and climate change. Seed Sci. Res. 22, S53–S60 (2012).
Article Google Scholar
Beckstead, J., Meyer, S. E., Connolly, B. M., Huck, M. B. & Street, L. E. Cheatgrass facilitates spillover of a seed bank pathogen onto native grass species. J. Ecol. 98, 168–177 (2010).
Article Google Scholar
An, H., Zhao, Y. & Ma, M. Precipitation controls seed bank size and its role in alpine meadow community regeneration with increasing altitude. Glob. Change Biol. 26, 5767–5777 (2020).
Article ADS Google Scholar
Plue, J. et al. Climate-controlled seed bank patterns. Glob. Ecol. Biogeogr. 22, 1106–1117 (2013).
Article Google Scholar
Millard, S. P EnvStats: An R Package for Environmental Statistics (Springer, 2013).
Gerstner, K., Dormann, C. F., Václavík, T., Kreft, H. & Seppelt, R. Accounting for geographical variation in species–area relationships improves the prediction of plant species richness at the global scale. J. Biogeogr. 41, 261–273 (2014).
Article Google Scholar
Olson, D. M. et al. Terrestrial ecoregions of the world: A new map of life on Earth. BioScience 51, 933–938 (2001).
Article Google Scholar
Ploton, P. et al. Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nat. Commun. 11, 4540 (2020).
Article ADS CAS Google Scholar
van den Hoogen et al. A geospatial mapping pipeline for ecologists. Preprint at bioRxiv https://doi.org/10.1101/2021.07.07.451145 (2021).
Venter, O. et al. Global terrestrial Human Footprint maps for 1993 and 2009. Sci. Data 3, 160067 (2016).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Cutler, D. R. et al. Random forests for classification in ecology. Ecology 88, 2783–2792 (2007).
Article Google Scholar
Liaw, A. & Wiener, M. Classification and regression by randomForest. R News 2, 18–22 (2002).
Google Scholar
Genuer, R., Poggi, J.-M. & Tuleau-Malot, C. VSURF: An R package for variable selection using random forests. R J. 7/2, 19–33 (2015).
Article Google Scholar
Welling, S. H., Refsgaard, H. H. F., Brockhoff, P. B. & Clemmensen, L. K. Forest floor visualizations of random forests. Preprint at https://arxiv.org/abs/1605.09196 (2016).
Evans, J. S. & Murphy, M. A. rfUtilities. R package version 2.1–3 https://cran.r-project.org/package=rfUtilities (2018).

Download references

Acknowledgements

We thank Prof. Ken Thompson from the University of Sheffield, UK, for critical comments, which improved the manuscript. We thank W. Zhang, Y. Xu, C. Di, W. Ji, M. Dong, W. Ren, Y. Yang, T. Shao, and M. Wu from the School of Life Sciences, Shanxi Normal University for the assistance in collecting part of the data. This work was supported by the National Natural Science Foundation of China (32071524 and 31770514 to X.Y., and 31861143024 to Z.H.). International research travel by J.H.C.C. was partly funded by the Royal Netherlands Academy of Arts and Sciences (KNAW, CEP grant 12CDP007).

Author information

Authors and Affiliations

State Key Laboratory of Vegetation and Environmental Change, Institute of Botany, Chinese Academy of Sciences, Beijing, China
Xuejun Yang & Zhenying Huang
Department of Biology, University of Kentucky, Lexington, KY, USA
Carol C. Baskin & Jerry M. Baskin
Department of Plant and Soil Sciences, University of Kentucky, Lexington, KY, USA
Carol C. Baskin
Department of Ecological Sciences, The James Hutton Institute, Craigiebuckler, Aberdeen, AB15 8QH, UK
Robin J. Pakeman
The School of Life Sciences, Shanxi Normal University, Linfen, Shanxi, China
Ruiru Gao
Systems Ecology, Department of Ecological Science, VU University, De Boelelaan 1085, 1081 HV, Amsterdam, The Netherlands
Johannes H. C. Cornelissen

Authors

Xuejun Yang
View author publications
You can also search for this author in PubMed Google Scholar
Carol C. Baskin
View author publications
You can also search for this author in PubMed Google Scholar
Jerry M. Baskin
View author publications
You can also search for this author in PubMed Google Scholar
Robin J. Pakeman
View author publications
You can also search for this author in PubMed Google Scholar
Zhenying Huang
View author publications
You can also search for this author in PubMed Google Scholar
Ruiru Gao
View author publications
You can also search for this author in PubMed Google Scholar
Johannes H. C. Cornelissen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Y., C.C.B., J.M.B., Z.H., and J.H.C.C. conceived the study. X.Y. and R.G. collected the data. X.Y. performed the analyses. The manuscript was drafted by X.Y., with contributions from C.C.B., J.M.B., R.J.P., and J.H.C.C.

Corresponding author

Correspondence to Zhenying Huang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Orsolya Valkó, Amanda Taylor, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Dataset 1

Dataset 2

Supplementary Software

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, X., Baskin, C.C., Baskin, J.M. et al. Global patterns of potential future plant diversity hidden in soil seed banks. Nat Commun 12, 7023 (2021). https://doi.org/10.1038/s41467-021-27379-1

Download citation

Received: 14 August 2021
Accepted: 16 November 2021
Published: 02 December 2021
DOI: https://doi.org/10.1038/s41467-021-27379-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.