Abstract
Global patterns of regional (gamma) plant diversity are relatively well known, but whether these patterns hold for local communities, and the dependence on spatial grain, remain controversial. Using data on 170,272 georeferenced local plant assemblages, we created global maps of alpha diversity (local species richness) for vascular plants at three different spatial grains, for forests and non-forests. We show that alpha diversity is consistently high across grains in some regions (for example, Andean-Amazonian foothills), but regional ‘scaling anomalies’ (deviations from the positive correlation) exist elsewhere, particularly in Eurasian temperate forests with disproportionally higher fine-grained richness and many African tropical forests with disproportionally higher coarse-grained richness. The influence of different climatic, topographic and biogeographical variables on alpha diversity also varies across grains. Our multi-grain maps return a nuanced understanding of vascular plant biodiversity patterns that complements classic maps of biodiversity hotspots and will improve predictions of global change effects on biodiversity.
Introduction
Our understanding of the global patterns of plant diversity largely stems from studies based on either local to national floras or stacked distribution range maps1,2,3,4. These approaches allow quantification of the total number of species occurring in a region but do not address how plant species co-occur locally and form species-rich or species-poor communities. With the notable exceptions of trees and ferns5,6,7,8,9, the global distribution of local plant diversity remains poorly understood10.
The species richness of local plant communities, i.e., alpha diversity, is non-linearly related to the size of the sampling unit, i.e., the spatial grain9,11,12,13. Enlarging the sampling unit means that more species are progressively captured in the same plot, so that the alpha diversity of a sampled plot slowly, but non-linearly, approaches the regional species richness, i.e. gamma diversity11,12,14. The steepness of the curve, i.e., beta diversity, determines how the plant community composition varies from place to place11. This non-linearity complicates direct comparisons of biodiversity data from place to place and makes mapping alpha diversity across large areas challenging. Even in well-sampled regions, available data are heterogeneous mixtures of surveys with varying spatial grains and sampling protocols, and different reference taxonomies9,15,16. Furthermore, there is a typical trade-off between spatial grain and extent in biodiversity research, with most fine-grained studies only covering limited spatial extents. Thus the question of whether global patterns of alpha diversity are consistent with known patterns of regional gamma diversity has remained unanswered.
Plant diversity patterns result from ecological and evolutionary processes acting at different spatial and temporal scales17,18. At continental and regional scales, evolutionary processes (migration, speciation, extinction) as well as geological and climatic history play key roles19,20. At local scales, diversity depends primarily on assembly processes related to species dispersal, habitat filtering and biotic interactions (including humans)3,21,22. There is clearly an intimate nested relationship between processes at different scales, as a species must be present regionally to occur locally, and large-scale environmental factors influence local conditions8,17,23. An exploration of alpha diversity patterns at multiple grain sizes can discriminate between areas where species richness is consistently high or low across grain sizes, and those where it is not, i.e., where species richness is either high at fine grains and low at coarse grains, or vice versa9,10,24,25. This may provide insights into the prevailing mechanisms that shape biodiversity distribution at different scales, and which produce and maintain global plant diversity11,26. For example, the discrepancies between alpha diversity patterns at different grains could indicate regional or biome-related variation in the roles of habitat heterogeneity, dispersal barriers or environmental filtering27,28,29.
Here, we explore alpha diversity patterns across multiple spatial grains globally. We leverage methodological advances in modeling biodiversity across scales9,12,18,30,31,32 using the sPlot database, a global initiative that aggregates and harmonizes local-scale species co-occurrence data from hundreds of independent datasets and vegetation surveys15,16. The sPlot database incorporates more than 1 million vegetation plots and covers both natural and semi-natural ecosystems on all continents and in all biomes15. We focused on terrestrial vascular plants only, since data on bryophytes, lichens, vascular epiphytes and aquatic habitats are too scattered in the sPlot database.
We applied machine learning (boosted regression trees) to model the relationships between vascular plant species richness at different grains and 20 global datasets on current and past climate, soil and topography. Our models allowed relationships between alpha diversity and environmental variables to vary across grains by including interaction terms between plot size and other predictors9. To simultaneously quantify uncertainty and to account for the uneven distribution of data across biomes and vegetation formations in our database, we averaged our results over 99 model runs, each based on a stratified resampling of the data (Supplementary Fig. 1, Supplementary Data 1). By modeling the relationships between alpha diversity and environmental variables across the globe, we (1) predicted alpha diversity of vascular plants at three different grain sizes spanning two orders of magnitude, (2) determined how the explanatory power of potential environmental drivers on alpha diversity varies across the three grain sizes, and (3) identified regional scaling anomalies, i.e., areas where alpha diversity is high at fine grain but low at coarse grains, or vice versa.
Results
Multi-grain global maps of local species richness
We modeled forest and non-forest ecosystems jointly but focus on each broad formation separately in the main text. Modeling them separately yielded similar results (not shown). For forests, we generated estimates for the three grain sizes most commonly used for sampling forests: 400 m2, 1000 m2 and 1 ha. At the finest grain (400 m2), the estimated alpha diversity of vascular plants (median prediction of each pixel of 2.5 arcminute resolution across the 99 resampled subsets) ranged from 1 to 120 species (median across all pixels = 22, interquartile range or IQR = 10; Fig. 1A, Supplementary Table 1). The areas with alpha diversity above the global 95th percentile (hereafter ‘hotspots’) were the forest-steppe region of easternmost Europe and Siberia, East Asia, Borneo and New Guinea, the eastern coast of Australia, the western Congo Basin, eastern Madagascar, the Andean-Amazonian foothills, the South American Atlantic Forest (‘Mata Atlântica’) and the Appalachian Mountains. Coldspots (i.e., areas with alpha diversity at a given grain size below the global 5th percentile) occurred in the Atlantic and Mediterranean part of Europe, central and western India, southern Australia, central Africa – specifically the eastern Guinean forest and the Sudanian savanna belt – and along the Pacific coast of North America. At the intermediate grain (1000 m2), the median estimated richness per grid cell in forest ecosystems ranged from 1 to 197 vascular plant species (global median across all grid cells = 29, IQR = 13) (Fig. 1B, Supplementary Table 1). Compared to the finest grain, all the hotspots in the equatorial region (Indonesia, Borneo, Andean-Amazonian foothills) increased in extent, whereas hotspots in the temperate and boreal regions either disappeared or shrank considerably. The coldspots in Western and Southern Europe and western North America remained, while those in central Africa diminished in size. Finally, at the coarsest grain (1 ha), average species richness per grid cell ranged from 2 to 921 species (median = 40, IQR = 39; Fig. 1C, Supplementary Table 1). At this grain, the well-known difference in species richness between the tropics and the boreal and temperate regions became apparent. The South American hotspots became connected, forming a belt spanning from the Andean-Amazonian foothills through the Chiquitano dry forest to the southern Pantanal and the Mata Atlântica regions. The hotspot in the western Congo Basin increased in size (Fig. 1C). The temperate region contained no hotspots at this grain. The coldspot in southern Australia expanded to the eastern coast, while the coldspot in central Africa disappeared. The uncertainty in alpha diversity estimates, quantified as the ratio between IQR and median across the 99 resampled subsets, was highest in the boreal regions of Canada, Central and Eastern Siberia, the Amazon and Sundaland (Supplementary Fig. 2).
Spatial grains: A 400 m2; B 1000 m2; C 1 ha. The maps on the left show the median estimated species richness at the corresponding spatial grain for each 2.5 arcminute grid cell of the World, averaged over 99 boosted regression tree models based on different resampled datasets. Colors are on a log2 scale. The maps on the right show the distribution of hotspots (red) and coldspots (blue), i.e., areas where species richness is above the 95th or below the 5th global percentile, respectively. We only show alpha diversity estimates for locations where forests would grow under current climate conditions and without human influence208. Hatching represents data-poor regions, i.e., regions farther than 500 km from any vegetation plots, for which we did not generate predictions. Global maps with predictions for these data-poor regions can be found in Supplementary Fig. 3. Values are averaged over 2600 km2 hexagons. Source data are provided as a Source Data file.
For non-forest ecosystems, we used an alternative set of grains: 10 m2, 100 m2 and 1000 m2, to match the most frequently used plot sizes in our database. At the finest grain (10 m2), the median estimated alpha diversity across the 99 resampled subsets ranged from 0 to 68 vascular plant species (median across all grid cells = 14, IQR = 7; Fig. 2A, Supplementary Table 1). At this grain, non-forest hotspots were widely distributed across the forest-steppe region of easternmost Europe and Siberia, the central loess plateau of China, southern Eastern Australia, the Drakensberg region in South Africa, subtropical South America and eastern North America. Coldspots were widespread in southern Central Asia, central and northwestern Australia, the Sahel region of Africa and along the Pacific coast of South America. At the intermediate grain (100 m2), the median estimated species richness per grid cell ranged from 0 to 90 (median = 17, IQR = 9, Fig. 2B, Supplementary Table 1), and the distribution of hotspots and coldspots remained essentially unchanged compared to the finest grain. At the coarsest grain (1000 m2), the median estimated richness per grid cell ranged from 0 to 184 species (median = 23, IQR = 13, Fig. 2C, Supplementary Table 1). Except for the Loess plateau in China, hotspots were almost exclusively concentrated in subtropical regions at this scale, especially southeastern Australia, Madagascar, the Appalachian region, and the Pantanal and southern Cerrado in South America. The location of coldspots hardly changed compared to finer grains. The uncertainty in alpha diversity estimates was highest in northern Canada, the Tibetan Plateau and the Persian Gulf region (Supplementary Fig. 2). A map jointly showing alpha diversity of forest and non-forest ecosystems at 1000 m2 grain is available in the supplementary material (Supplementary Fig. 4).
Spatial grains: A 10 m2; B 100 m2; C 1000 m2. The maps on the left show the median estimated species richness at the corresponding spatial grain for each 2.5 arcminute grid cell of the World, averaged across 99 boosted regression tree models based on different resampled datasets. Colors are on a log2 scale. The maps on the right show the distribution of hotspots (red) and coldspots (blue), i.e., areas where species richness is above the 95th or below the 5th global percentile, respectively. We only show alpha diversity estimates for locations where the land cover ‘herbaceous vegetation’ occurs based on a consensus map that integrates multiple global remote sensing-derived land-cover products209. Hatching represents data-poor regions, i.e., regions farther than 500 km from any vegetation plots, for which we did not generate predictions. Global maps with predictions for these data-poor regions can be found in Supplementary Fig. 5. Values are averaged over 2600 km2 hexagons. Source data are provided as a Source Data file.
Overall, the models showed a relatively high predictive power (average over 99 resampling iterations: Pearson’s r = 0.49), even after implementing a spatially constrained, block cross-validation33 that accounted for the residual non-independence of training and test datasets arising from the clustered nature of our database34. We found no major bias or trend in residuals across grain sizes, biomes or geographical regions (Supplementary Fig. 6), and the frequency distributions of observed and predicted values largely overlapped (Supplementary Fig. 7, Supplementary Table 2). The predicted values showed a slight tendency towards the sample mean with thinner tails at the extremes, which is a common feature of ensemble machine-learning methods, even with the bias-correction method we used (see Methods)35. Minor deviations only occurred for the dry mid-latitude and boreal biomes at coarse grains (Supplementary Fig. 7). Given the relatively small sample size for the wet tropics, we recommend interpreting the results for these regions with caution. For a complete description of model validation, see Supplementary Methods.
Environmental and biogeographical determinants
Our statistical models reveal which of the environmental and biogeographic variables tested appear to drive alpha diversity of vascular plants (Fig. 3). Among the predictors having a higher-than-expected relative influence, plot size, i.e., the grain size of the vegetation plot, consistently ranked first across the 99 resampled models. Climate also had a high relative influence in shaping alpha diversity patterns, especially annual mean temperature and the temperature of the warmest and wettest quarter of the year (PC1 and PC4, respectively, in a principal component analysis based on 18 bioclimatic variables). The ecoregional species pool, i.e., the estimated number of species occurring in the ecoregion in which a given plot is located2, was the fourth most important predictor, highlighting the nested link between local and regional biodiversity. Finally, despite the expected importance of soil conditions for local plant diversity, only one soil variable, i.e., the percentage of coarse soil fragments, had an influence greater than 5%.
Points represent the median relative importance of a predictor across 99 runs of a boosted regression tree model that jointly models vascular plant species richness in forest and non-forest formations. The bars connect the 2.5th and the 97.5th percentiles of the relative importance distribution across runs. The vertical dashed line separates variables with relative influence higher or lower than expected, i.e., those variables whose relative influence is higher or lower than 100% divided by the number of variables (n = 20).
We created partial dependence plots to explore the directionality of these relationships and whether they are consistent across spatial grains and vegetation formations (Fig. 4). Plant alpha diversity increased non-linearly with increasing plot size. This effect saturated at relatively fine grains (~100 m2) in non-forest ecosystems and at 1 ha in forest ecosystems, which can be explained by the different grains at which forests and non-forests were sampled, and the different spatial structure of these vegetation types. Grain size interacted with most of the other predictors, as revealed by the different environment–richness relationships at different grains (Fig. 4). Alpha diversity increased when the size of the ecoregional species pool increased, but only for coarse grains. It also increased toward tropical regions (i.e., regions with higher temperatures of the warmest and wettest quarters, high scores on PC4) and at higher mean annual temperature (PC1), especially for coarse grains.
These plots show the fitted function of the most influential variables explaining vascular plant alpha diversity at three different spatial grains, while holding all other predictor variables constant at their mean value. The fitted function is the difference between the response value at a given value of each predictor and the mean response value. Each line represents the fitted function for one of the 99 boosted regression tree model runs. Fine, intermediate and coarse grains correspond to 400 m2, 1000 m2 and 1 ha in forests (A) and 10, 100, and 1000 m2 in non-forest ecosystems (B), respectively. Variables are sorted by decreasing relative influence. The rug plots on the x-axis display the distribution of the calibration data. Note the different range of plot sizes between forest and non-forest ecosystems.
Regional scaling anomalies in species richness across grain sizes
Many areas with relatively high fine-grained alpha diversity also had high alpha diversity at coarser grains (Fig. 5). For forests, our models revealed consistently high alpha diversity across grains in Sundaland, the Congo Basin, Madagascar, as well as in the eastern Andean foothills, the Amazon Basin and the Southern American Mata Atlântica (Fig. 5A). Areas with consistently low alpha diversity across all grains were the western parts of the USA and Canada, the Atlantic region of Europe, Fennoscandia, the Mediterranean Basin, central and northern India, and southern Australia. However, not all areas with relatively high fine-grained richness also had high coarse-grained richness, and vice versa, revealing regional scaling anomalies in plant alpha diversity patterns18. Areas with high plant alpha diversity at coarse grains, but relatively low alpha diversity at fine grains, were the tropical forests of Africa and the Guiana Shield in South America. The opposite was true in the Eastern European forest–steppe belt, northeastern Argentina, Eastern Australia and New Zealand (Fig. 5A).
Correspondence between estimates of plant alpha diversity at fine and coarse grains for A forest and B non-forest ecosystems. Fine-grained alpha diversity was calculated at 400 m2 and 10 m2 for forest and non-forest ecosystems, respectively. Coarse-grained alpha diversity was calculated at 1 ha and 1000 m2 for forest and non-forest ecosystems, respectively. We only show alpha diversity estimates where (A) forests would grow under current climate conditions and without human influence208, or B the land cover ‘herbaceous vegetation’ occurs, based on a consensus map integrating multiple global remote sensing-derived land-cover products209. Color codes are based on quartile distributions of species richness at the two grains. Parallel hatching represents data-poor regions, i.e., regions farther than 500 km from any vegetation plots. Values are averaged over 7700 km2 hexagons. SR: species richness. Source data are provided as a Source Data file.
The regions hosting non-forest ecosystems with consistently high plant alpha diversity across grains were the European Alps, the forest-steppe of Eastern Europe and Siberia, the loess plateau of China, Eastern Australia, eastern South Africa, Madagascar, the Chaco, Mata Atlântica and some other regions of South America, and eastern North America (Fig. 5B). Consistently low plant alpha diversity across grains occurred in Inner Asia and in the northern African desert and semi-desert regions, the Tibetan Plateau, Namib Desert, central Australia, the Atacama and High Monte deserts in the high Andean plateaus south of the equator as well as in the North American prairies and deserts. High coarse-grain species richness was associated with low fine-grain species richness in the Myanmar-Thailand-China borderland, Ethiopia and Mexico. The opposite situation was relatively rare, occurring locally in the temperate grasslands of southeastern Australia.
Discussion
By simultaneously highlighting patterns at multiple spatial grains, our maps provide a nuanced picture of the pattern of alpha diversity of vascular plants. This complements our understanding of the distribution of biodiversity hotspots36 and regional (i.e., gamma) vascular plant diversity2,3,4,37. Within the broad range of plot sizes commonly used for vegetation sampling, our maps distinguish between regions where high coarse-grained alpha diversity results largely from high fine-grained richness, and regions where high coarse-grained alpha diversity results more from species turnover between adjacent plant communities (i.e., fine-grained beta diversity).
Our results are consistent with previous studies suggesting that forests in Borneo, New Guinea, Madagascar, eastern South Africa and the Andean-Amazonian foothills are hotspots for plant biodiversity across all spatial grains37. There is considerable agreement between our map of 1-ha alpha diversity in forests and a recently published global map of tree species richness at the same grain9. Similarly, patterns of fine-grained alpha diversity in non-forest ecosystems are consistent with the local and regional patterns recently observed for alpine vegetation38 and Palearctic grasslands25. We also found good agreement with previous research in the distribution of areas of low diversity (coldspots), such as the non-forest vegetation in the western Tibetan Plateau, the semi-desert regions of central Asia, coastal Somalia and the forests in the Pacific Northwest of North America, despite the large difference in grain37.
In some regions, however, the difference between our results and previously reported patterns was striking. None of the regions holding the world records of plant alpha diversity appeared in our results39. The foothills of the Carpathians, for instance, are known for hosting semi-natural grasslands that are among the most species-rich plant communities globally at fine grains (e.g., >100 species in 16 m2)39,40. As many as 233 species (including 59 epiphyte species, not considered here) were observed in a 100 m2 rain forest plot in Costa Rica41. At intermediate grains, very high plant species richness has been reported for the hemiboreal forests of the northern Russian Altai (149 species per 1000 m2)42 and Colombia (313 species per 1000 m2)43. At coarse grains, the world record is in Ecuador (942 species in 1 ha, including 172 epiphytes)44. Except for the Altai region, however, our maps do not show record high species richness in any of these regions. A general explanation is that our maps represent local averages across model runs, large areas (2.5 arcminute grid resolution) and a mixture of habitat types, so that the richest sites, which are rare in the landscape, have been averaged with neighboring sites that belong to other ecosystems with lower species richness. This is true, for instance, in Europe, where our data contained most non-forest vegetation types, including species-poor grasslands on acidic soils. The lack of data for epiphytes can partially explain why our model did not predict the expected high alpha diversity in Mesoamerica, where this growth form can account for up to 25% of forest species41,45,46.
Interestingly, our models highlighted that alpha diversity does not differ markedly between temperate and tropical regions at the finest grains, but differences become more pronounced at coarser grains. This may reflect the often overlooked fact that tropical forests have a relatively species-poor herb layer compared to temperate forest ecosystems46,47. For instance, the high alpha diversity of trees in West African forests7 is not accompanied by an equally high richness of herb or shrub species in the understorey. The low diversity in these understories could be due to the fact that tropical lowland forests have a closed canopy year-round48, or that fires occur frequently, favoring grass-dominated, species-poor understories49. Together with the scarcity of data on epiphytes, a species-poor herb layer might explain why tropical lowland forests exhibit scaling anomalies, namely low alpha diversity at fine grains but high at coarse grains. If most of the diversity (or data) is in the tree layer, large vegetation plots are needed to ensure that the diversity of an ecosystem is appropriately sampled, as few tree individuals can physically co-occur at small sampling grains. We note, however, that uncertainties were high for tropical forests, requiring a cautious interpretation of these results.
In general, finding these scaling anomalies points to the role of beta diversity as a cross-scale diversity metric, and suggests that the relative contribution of different eco-evolutionary processes in determining plant diversity patterns varies between regions. In many tropical lowland forests, alpha diversity is low at fine grains but increases rapidly with increasing grain size. This is the case, for instance, in the western Amazon, where much of the regional (gamma) diversity depends on species turnover rather than on the coexistence of a high number of species at the same site50. This suggests that the tropics might be shaped by processes promoting species coexistence through a tighter packing in the niche space. Recent work found a latitudinal increase in niche specialization and marginality of trees towards the equators, which has been attributed to the stable climate and high productivity in the tropics51. Alternative explanations include rarity and priority effects related to high productivity29, more uniform environmental conditions and stronger dispersal limitation at fine scales28, or stronger mycorrhiza-mediated effects of interspecific competition and habitat adaptation52 in the tropics compared to temperate regions. While the relative contribution of these processes remains a matter of speculation, our work points to the need for an improved understanding of the spatial variation of beta diversity in plant diversity analysis53. Beta diversity, rather than alpha diversity per se, appears to be the main driver of spatial differences in gamma diversity between temperate and tropical regions.
Conversely, we observed high plant alpha diversity at fine grains but relatively low alpha diversity at coarse grains in many temperate regions, including the Eastern European forest-steppe belt, East Asia and southeastern Australia. This pattern might be indicative of effective niche partitioning at fine grains and more homogeneous landscapes without dispersal barriers at coarse grains54. There is evidence that niche processes play a stronger role than neutral processes in determining fine-scale beta diversity at higher latitudes and altitudes28,29,30, where species are thought to have broader niches and be less responsive to geographical changes55. This is consistent with recent findings that the nestedness of tree communities increases with latitude, possibly due to the high share of ectomycorrhizal species in colder and wetter conditions52. Finally, high species richness at fine grains might also depend on plant size, as many small plants can coexist in a given grain size. Such conditions mainly occur in grasslands, e.g., in Eastern Australia, where this mechanism has been invoked to explain differences in beta diversity among vegetation types56.
Our work allows us to rank the predictors of alpha diversity by their importance. Since the species–area relationship has often been described as one of the few rules in ecology14, the high importance of plot size in our models is not surprising. Our important advance, however, is that by explicitly incorporating this nonlinear relationship into our models, we created a grain-independent model that links alpha diversity to multiple climatic, topographic and biogeographical predictors. We also showed that ecoregions with a large species pool are more likely to host species-rich communities. This pattern became disproportionately stronger at coarser grains, probably because at finer grains the maximum number of locally co-occurring species is constrained by the number of individuals that can fit into the grain. The other biogeographical covariates, namely biomes and realms, had very little effect on predicting alpha diversity. This is probably because they are closely related to other predictors with stronger explanatory power, i.e., macroclimate and ecoregions, respectively3. The increasing influence of macroclimate and ecoregional species pool with grain size is, however, in line with evidence on the role of climatic and geological histories of ecoregions on species pools8,10,20,24. This is not surprising since tectonic movements, uplift of mountain ranges, climatic stability, and glaciation events all play a role in driving regional speciation and extinction rates3. This result supports the view that, although intimately related, habitat filtering and biogeographical factors related to regional differences in geological and climatic history, have a different influence on patterns of alpha diversity at fine vs. coarse grains10.
Although our study is based on the largest collection of global vegetation-plot data ever compiled, there are some shortcomings. The most important limitation is the uneven distribution of vegetation plots across biogeographical regions. Most of our data points were in Europe and other countries with a strong tradition of vegetation surveys, while the coverage of tropical areas, especially the Amazon and equatorial Africa, was poor (Supplementary Fig. 1). Furthermore, data from tropical forests were often incomplete, containing information on woody species only. Although the targeted search for additional data, coupled with the stratified resampling and statistical model we applied, mitigate these problems (see Methods), they clearly cannot compensate for the lack of comprehensive data on plant composition in many species-rich regions (especially large parts of the tropics). Ongoing initiatives to mobilize existing data, expand biodiversity surveys by including underreported growth forms such as herbs or epiphytes45,46, and improve the overall taxonomic knowledge for these regions47,57 are, therefore, high priorities in biodiversity research58. A second limitation is the scale mismatch between some very fine-grained vegetation plots and our use of coarse-grained environmental predictors, as highlighted by other global-scale biogeographical analyses22. Thus, our models ignore the mounting evidence of the strong modulating impact of local land cover, topographic heterogeneity and vegetation structure on climatic conditions, rendering the environmental conditions experienced by organisms at the local scale markedly different from those inferred from global macroclimatic models59. Finally, our analysis focuses on natural and semi-natural plant communities but ignores the role of human impacts and non-native species invasions. These effects are too diverse and multifaceted to be included in a simple statistical model but clearly play a major role in the distribution of plant species, both at local and regional scales60. Taken together, these limitations imply that although the accuracy of our models was relatively high, our results may still be missing important environmental drivers, especially at fine grain sizes.
Despite these limitations, our analysis provides important insights and is a step forward in mapping global plant diversity. First, it reinforces the idea that large-scale evolutionary and historical processes interact with local factors to shape plant communities3,17,23. Indeed, our models indicate that macroecological gradients have a consistent effect on plant alpha diversity, but with magnitudes that vary across grains. Second, by highlighting regional scaling anomalies in alpha diversity across different plot sizes, our study can improve our ability to predict biodiversity response to global change11. Third, our work adds a new dimension to our understanding of global biodiversity patterns and hotspots previously defined based on gamma diversity only. This could have implications for conservation. For example, coarse-grained hotspots might require networks of relatively large protected areas, whereas fine-grained hotspots might be more sensitive to biotic homogenization and more dependent on maintaining traditional management or a particular type of land use. Explicit consideration of the difference between coarse- and fine-grained hotspots complements the regional data on species richness and endemism commonly used for delineating global biodiversity hotspots.
Methods
Species richness data
The vegetation-plot database ‘sPlot’ (www.idiv.de/splot) collates 110 national or regional vegetation-plot datasets. Vegetation-plot records provide geo-referenced information on the presence and cover/abundance of all vascular plants co-occurring within a delimited area. The sPlot database version 2.1 contains records from 1,121,244 vegetation plots surveyed between 1885 and 2015. These comprise 23,586,216 occurrence records for 58,066 vascular plant taxa, whose names have been standardized to a common nomenclature15. When the formation to which a plot belonged was not specified (n = 137,146 plots), we used the growth form of the recorded species61 to classify a plot as forest or non-forest as in ref. 22. That is, we defined a plot record as forest if the sum of the cover values of all tree taxa was >25% of the sum of the cover values of all species in that plot, and as non-forest, if the sum of cover values of all low‐growing taxa other than trees and shrubs was >90% of the sum of the cover values of all species in that plot. Plots not meeting either condition were excluded from the analysis, as well as all plots belonging to wetland or aquatic vegetation. Plots also had a wide variation in the sampled area (1–25,000 m2). Therefore, we performed a preliminary screening and only retained plots sized between 100 and 25,000 m2 for forest, and between 10 and 1500 m2 for non-forest, as these are the most frequent plot sizes used by plant ecologists in the field. Plots without information on the sampled area were also excluded. Similarly, we excluded all plots that we could confidently assign to anthropogenic communities, here defined as any vegetation that is shaped by intensive and repeated human interference, including weed communities on arable land, ruderal vegetation and intensively managed pastures and meadows.
The data in the sPlot database are geographically biased since plots are unevenly distributed across geographical regions and formations (Supplementary Fig. 1), with relatively few data from the wet tropics. We therefore made a special effort to improve the data coverage in these regions by searching for publications and databases that report species richness, plot size and spatial coordinates of vegetation plots in the tropics. We focused on plots for which the full assemblage of vascular plants (with or without epiphytes) was sampled. However, such data were particularly scarce in many regions (e.g., the central Amazon, Western Ghats and Sundaland). For these regions, we also included data reporting woody species richness only (along with the diameter at breast height—DBH—used as the minimum sampling threshold). In total, we found information for an additional set of 1914 vegetation plots from 53 papers (Supplementary References). Of these, only 170 vegetation plots contained species richness information for all vascular plants. Finally, we scanned the Global Index of Vegetation-plot Databases62 to retrieve additional datasets from the tropics, which were not included in sPlot 2.1. We obtained permission to use 11 local datasets, totaling 7929 additional vegetation plots (7385 with species richness data for all vascular plants). In total, our database contained 412,452 vegetation plots41,59,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193,194,195,196 (Supplementary Fig. 1, Supplementary Data 1).
Data cleaning and geographical resampling
To further mitigate the remaining geographical bias in vegetation-plot distribution and to account for the fact that plot sizes vary markedly across regions and vegetation types (Supplementary Fig. 8), we applied a stratified resampling strategy that we repeated 99 times. We defined each stratum as a unique combination of realm197, biome15, broad formation (two classes: forest and non-forest), and plot size as a factor variable with four levels (small: ≤150, medium: 150–600, large: 600–1200, very large: >1200 m2). These intervals were chosen to encompass the grains used for predictions (i.e., 10, 100, 400, 1000 and 1 ha, see below) while accounting for the fact that some plot sizes are more routinely used than others. For each stratum, we randomly sampled (without replacement) up to 100 vegetation plots in each iteration. If a stratum had fewer than 100 vegetation plots, we retained all of them. This procedure resulted in the selection of 17,972 plots in each iteration. The total number of plots used across the 99 iterations was 170,272. Altogether, these plots provided 9,953,940 occurrence records for 53,271 vascular plant taxa, i.e., ~15% of the estimated ~350,000 vascular plant species that exist. This figure is slightly underestimated, since for 1893 plots (59,299 occurrence records) only aggregated alpha diversity data were available, but no species-level data.
Not all vegetation plots were complete with respect to the sampled functional groups. Most records from tropical forest plots contained either only tree data, or only data on trees and shrubs (Supplementary Fig. 9). Excluding these plots would not be optimal, as it would have greatly reduced the spatial coverage of our dataset. Since most of these incompletely sampled plots were from the tropics, excluding them would also create the risk of introducing a strong spatial bias into our model. Therefore, we retained these plots in the dataset and included a new predictor variable called ‘plants recorded’ (three levels: ‘complete vegetation’, ‘trees and shrubs only’, ‘trees only’) in our statistical models (see below). Specifically, a plot belonged to the ‘only trees’ level if it only contained information on woody species with a diameter at breast height (DBH) larger than 5 cm. It belonged to the ‘only trees and shrubs’ level when it either contained information on all woody species (both trees and shrubs) but not herbs, or if the minimum DBH threshold used for sampling woody individuals was less than or equal to 5 cm.
As most of these incompletely sampled plots were in the tropics, we simulated the occurrence of incomplete plots also in the other biomes when resampling the full database. This was achieved by selecting some plots with complete vegetation information and recalculating their species richness when accounting for ‘only trees’, i.e., discarding all information on the occurrence of shrub and herb species, or for ‘only trees and shrubs’, i.e., discarding information on herbs. We limited this procedure to biomes with >10,000 plots with complete vegetation information (i.e., subtropics with winter rain, subtropics with year-round rain, temperate mid-latitudes). In these biomes, 20% of all the plots selected randomly within each resampling iteration (623 on average) were transformed this way. This corresponded to an increase in the number of incomplete plots in these selected biomes from 151 to 359 (on average over the 99 iterations), which is close to the average number of incomplete plots occurring in the other biomes (n = 373). By rarefying data to simulate plots with incomplete vegetation records, we reduced the possible geographical bias resulting from the uneven distribution of incomplete plots across biomes. This allowed the use of incomplete plots from tropical regions (where complete plots are rare) when modeling the response of local vascular plant richness at the global scale (see below).
Explanatory variables
Based on the plots’ geographic coordinates, we retrieved bioclimatic, soil, topographic and biogeographical variables from external sources, which we used as explanatory variables for species richness modeling. We extracted all the 19 bioclimatic variables included in CHELSA v1.1198, and seven soil variables at 250-m resolution from the SOILGRIDS project199. The soil variables were: (1) clay mass fraction (%); (2) silt mass fraction (%); (3) sand mass fraction (%); (4) coarse fragment fraction (%); (5) soil organic carbon content (g/kg); (6) soil pH (measured in water); and (7) cation exchange capacity. After standardizing and centering all 26 variables, we performed two principal component analyses (PCA), one for climate and one for soil. For subsequent analyses, we used the first five principal components for climate and the first four for soil, because these components accounted for more than 90% of the total variation in these ordinations. We interpreted these principal components based on the respective loadings of the corresponding environmental variables. For climate, the predictors with the highest loadings were: mean annual temperature for PC1; mean annual precipitation and mean diurnal temperature range for PC2; precipitation seasonality and precipitation of the wettest quarter for PC3; temperature of the wettest and temperature of the warmest quarter for PC4; and precipitation of the coldest quarter for PC5 (Supplementary Table 3, Supplementary Fig. 10). For soils, PC1 was mainly explained by soil bulk density; PC2 by sand content; PC3 by the percentage of coarse fragments and PC4 by soil pH (Supplementary Table 4, Supplementary Fig. 11).
To account for topographic heterogeneity, we also extracted data on plot topography from the EarthEnv.org data portal200. Specifically, we used terrain ruggedness (TRI, calculated at 50 km resolution), dominant landform (10 types at 1 km resolution: flat, peak, ridge, shoulder, spur, slope, hollow, footslope, valley, pit), and the number of landforms within a 50 km radius around each plot.
To account for historical and biogeographical factors, we included two predictors of the velocity of climate change between the Last Glacial Maximum and the present (one for temperature, one for precipitation) derived from ref. 201. These layers measure the local rate of displacement of climatic conditions and integrate macroclimatic shifts with local spatial topoclimatic gradients. Additionally, we considered two nominal biogeographical variables, realm197 and biome15, which we considered as rough proxies of the different geologic, biogeographical and climatic histories of different regions. The biomes were derived from Schultz’s ecozones202, which we modified to distinguish alpine areas203. Thus, our biomes are not nested within realms. As another surrogate for the biogeographical imprinting on alpha diversity patterns, we also accounted for regional effects by including the estimated size of the regional species pool for each of the 867 terrestrial ecoregions of the world2.
We then considered three additional predictors: a binary variable distinguishing two broad formations (i.e., forest: True\False), a nominal predictor accounting for the different functional groups sampled in each plot (i.e., ‘complete vegetation’, ‘only trees and shrubs’ and ‘only trees’, see above), and plot size, i.e., the spatial grain used in vegetation sampling.
In total, we considered 20 predictors: five principal components summarizing climate, four principal components summarizing soils, three variables quantifying topographic heterogeneity, five related to biogeographical history, one representing vegetation formation and two related to sampling design. Multicollinearity among predictors was limited, as no pair of predictors had Pearson’s r coefficient greater than 0.64 (Supplementary Fig. 12).
Statistical modeling
We used boosted regression trees (BRTs) to model the relationships between species richness and the explanatory variables. BRTs are nonparametric machine-learning models based on decision trees in a boosting framework. BRTs have few prior assumptions, are relatively robust against overfitting, missing data, and collinearity, and are very flexible in detecting nonlinear relationships and interactions among predictors204. We parameterized our BRTs as follows. We first set a tree complexity of 5 and a bag fraction of 0.5. We then systematically tested the combination between learning rates (from 0.00025 to 0.1) and the number of trees returning the highest 10-fold cross-validated model fit, using the gbm.step routine from the dismo package205. For each explanatory variable, we calculated its relative influence (i.e., the fraction of times a variable was selected for splitting a tree in each BRT model, weighted by the squared model improvement) across the 99 resampled sets. To visualize the relationship between species richness and the explanatory variables, we created partial dependence plots at selected grain sizes to visualize the marginal effect of a given predictor on the response variable. We considered an explanatory variable as relevant in the model if its relative influence (averaged over 99 resamplings) was greater than 5%, which is the expected share if all the 20 predictors had the same relative importance.
BRTs are unbiased on average, i.e., the sum of the residuals is close to zero. Yet, similarly to other ensemble machine-learning methods, they produce results that are biased in a different sense: small values are often overestimated and large values underestimated35. This happens because the final prediction is the unweighted average of a collection of regression trees, which inevitably leads to results biased towards the sample mean. To avoid this problem, we implemented a bias-correction algorithm called ROE: regression of observed on estimated values35,206. In the first step, we fitted a linear regression of the observed values on the fitted values:
where Sfit is the vector of species richness predicted by a BRT in a given iteration, and Sobs is the vector of observed species richness in that iteration. We then created a vector of bias-corrected, fitted species richness \({S}_{{{{{{{\rm{fit}}}}}}}}^{{{{{{{\rm{bc}}}}}}}}\) as:
thus, introducing the constraint that \({S}_{{{{{{{\rm{fit}}}}}}}}^{{{{{{{\rm{bc}}}}}}}}\) is no smaller than zero206.
We then used the above BRT models, together with the regression parameters a and b, to make bias-corrected predictions of local vascular plant richness at different plot sizes for all terrestrial pixels of the globe at 2.5 arcminute resolution. We did this separately for forest and non-forest ecosystems. For each pixel, we extracted the value for all 17 spatially explicit predictors (climate, soil, topography and biogeography) based on the pixel location. The variable ‘forest’ was set to ‘True’ for creating forest maps and ‘False’ for non-forest maps. For each of the 99 resampling iterations, we created multiple predictions, one for each selected sampling grain (i.e., 400 m2, 1000 m2 and 1 ha for forests, and 10, 100 and 1000 m2 for non-forests). In all cases, we only predicted species richness for the complete vegetation (i.e., including trees, shrubs and herbs). We also mapped the variability of our predictions, as the interquartile range (IQR - i.e., the difference between the 75th and 25th percentiles) across the 99 resampling iterations. Finally, we created a map of ignorance207 showing the geographic distance from the nearest vegetation plot used to calibrate our models (Supplementary Fig. 13). The map of ignorance highlights the uncertainty due to the uneven geographic distribution of vegetation plots and shows areas with limited or no data where our estimation should be taken with caution. Based on this map, we highlighted all data-poor regions, i.e., regions located farther than 500 km from the nearest plot, by parallel hatching in our maps. Given the strong structural differences between forests and non-forest ecosystems, we presented the multi-grain maps of plant richness separately for these two broad formations in the main text. Nevertheless, we also produced a joint map at 1000 m2 grain by complementing species richness estimates for forests with non-forest species richness for pixels outside the forest mask. For forests, we predicted all pixels where forests would grow under current climate conditions and without human influence208. For non-forest, we extracted all pixels where the land cover class ‘herbaceous vegetation’ occurs based on a consensus map integrating land-cover products derived from remote sensing209.
Model validation
We assessed model performance in three ways. First, we averaged the tenfold cross‐validation across resampled sets obtained from the BRT output. Second, for each of the 99 resampled sets, we selected all plots not used in the specific set and calculated Pearson’s correlation between species richness observed in a given plot and the respective BRT prediction at a grain corresponding to the plot area. As a third approach, we performed a spatially-constrained cross-validation210. We did this because our plots were spatially clustered and our spatial predictors had high spatial autocorrelation (2320 km on average across all the quantitative predictors, based on 5000 random samples, Supplementary Fig. 14). This means that even selecting plots completely independent of the training dataset does not ensure proper validation of our models, as the training and the test data remain spatially dependent. This violation of the fundamental assumption of model validation, namely the independence between training and test data, has been shown to affect many mapping models created with ‘Big Data’ approaches34. To avoid this problem, we divided the world into square spatial blocks whose size corresponds to the average spatial autocorrelation range of the quantitative predictors (i.e., 2320 km, n = 84, Supplementary Fig. 14). For each resampling, we randomly assigned each block to five folds using the function spatialBlock in the R package blockCV33, which selects the most even spread of vegetation-plot data across folds in 99 iterations (Supplementary Fig. 15). We then refitted our BRT model five times for each resampling, each time using four out of five folds for training and the remaining fold for validation, and averaged Pearson’s correlation coefficient between the observed and predicted species richness across folds. We also repeated this process separately for each biome separately, i.e., sequentially withholding all data located within a fold and a given biome for validation. We then reported the distribution of these correlation coefficients across the resampled sets, both when considering all plots, and when disaggregating by biomes. Finally, we checked the model residuals for spatial autocorrelation by fitting variogram models to the residuals using the function variogram from the R package gstat211. All analyses were performed in R 3.6.3212. Map boundaries derive from R package rnaturalearth213.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
Source data are provided with this paper. All species richness data necessary to reproduce the results of this manuscript, including those retrieved through the literature search, and all raster files (format: GeoTiff) used to create the multi-grain maps of species richness are available at: https://doi.org/10.25829/idiv.3506-p4c0mo (ref. 214).
The vegetation-plot raw data contained in the sPlot database are available upon request by submitting a project proposal to sPlot’s Steering Committee. The proposals should follow the Governance and Data Property Rules of the sPlot Working Group available on the sPlot website (www.idiv.de/splot). Source data are provided with this paper.
Code availability
The code for reproducing the analyses presented in this article is available at: https://zenodo.org/badge/latestdoi/433417900 (ref. 215).
References
Linder, H. P. Plant diversity and endemism in sub‐Saharan tropical Africa. J. Biogeogr. 28, 169–182 (2001).
Kier, G. et al. Global patterns of plant diversity and floristic knowledge. J. Biogeogr. 32, 1107–1116 (2005).
Kreft, H. & Jetz, W. Global patterns and determinants of vascular plant diversity. Proc. Nat. Acad. Sci. 104, 5925–5930 (2007).
Brummitt, N., Araújo, A. C. & Harris, T. Areas of plant diversity—What do we know? Plants, People, Planet 3, 33–44 (2020).
Gentry, A. H. Changes in plant community diversity and floristic composition on environmental and geographical gradients. Ann. Mo. Bot. Gard. 75, 1–34 (1988).
Slik, J. F. et al. An estimate of the number of tropical tree species. Proc. Natl Acad. Sci. 112, 7472–7477 (2015).
Parmentier, I. et al. The odd man out? Might climate explain the lower tree α‐diversity of African rain forests relative to Amazonian rain forests? J. Ecol. 95, 1058–1071 (2007).
Weigand, A. et al. Global fern and lycophyte richness explained: How regional and local factors shape plot richness. J. Biogeogr. 47, 59–71 (2020).
Keil, P. & Chase, J. M. Global patterns and drivers of tree diversity integrated across a continuum of spatial grains. Nat. Ecol. Evol. 3, 390–399 (2019).
Lenoir, J. et al. Cross-scale analysis of the region effect on vascular plant species diversity in southern and northern European mountain ranges. PLoS ONE 5, e15734 (2010).
Chase, J. M. et al. Species richness change across spatial scales. Oikos 128, 1079–1091 (2019).
Bruelheide, H., Jiménez-Alfaro, B., Jandt, U. & Sabatini, F. M. Deriving site-specific species pools from large databases. Ecography 43, 1215–1228 (2020).
Dengler, J. et al. Species–area relationships in continuous vegetation: Evidence from Palaearctic grasslands. J. Biogeogr. 47, 72–86 (2020).
Whittaker, R. J. & Fernández-Palacios, J. M. Island Biogeography: Ecology, Evolution, And Conservation (Oxford University Press, 2007).
Bruelheide, H. et al. sPlot —a new tool for global vegetation analyses. J. Veg. Sci. 30, 161–186 (2019).
Sabatini, F. M. et al. sPlotOpen—an environmentally balanced, open-access, global dataset of vegetation plots. Glob. Ecol. Biogeogr. 30, 1740–1764 (2021).
Ricklefs, R. E. Community diversity—relative roles of local and regional processes. Science 235, 167–171 (1987).
Crawley, M. J. & Harral, J. E. Scale dependence in plant biodiversity. Science 291, 864–868 (2001).
Antonelli, A. et al. An engine for global plant diversity: highest evolutionary turnover and emigration in the American tropics. Front. Genet. 6, 130 (2015).
Jiménez-Alfaro, B. et al. History and environment shape species pools and community diversity in European beech forests. Nat. Ecol. Evol. 2, 483–490 (2018).
Sabatini, F. M., Jiménez-Alfaro, B., Burrascano, S. & Blasi, C. Drivers of herb-layer species diversity in two unmanaged temperate forests in northern Spain. Community Ecol. 15, 147–157 (2014).
Bruelheide, H. et al. Global trait–environment relationships of plant communities. Nat. Ecol. Evol. 2, 1906–1917 (2018).
Pärtel, M., Bennett, J. A. & Zobel, M. Macroecology of biodiversity: disentangling local and regional effects. N. Phytol. 211, 404–410 (2016).
Field, R. et al. Spatial species‐richness gradients across scales: a meta‐analysis. J. Biogeogr. 36, 132–147 (2009).
Biurrun, I. et al. Benchmarking plant diversity of Palaearctic grasslands and other open habitats. J. Veg. Sci. 32, e13050 (2021).
Da, S. S. et al. Plant biodiversity patterns along a climatic gradient and across protected areas in West Africa. Afr. J. Ecol. 56, 641–652 (2018).
Gerstner, K., Dormann, C. F., Václavík, T., Kreft, H. & Seppelt, R. Accounting for geographical variation in species–area relationships improves the prediction of plant species richness at the global scale. J. Biogeogr. 41, 261–273 (2014).
Myers, J. A. et al. Beta-diversity in temperate and tropical forests reflects dissimilar mechanisms of community assembly. Ecol. Lett. 16, 151–157 (2013).
Muñoz Mazón, M. et al. Mechanisms of community assembly explaining beta-diversity patterns across biogeographic regions. J. Veg. Sci. 32, e13032 (2021).
Sabatini, F. M., Jiménez-Alfaro, B., Burrascano, S., Lora, A. & Chytrý, M. Beta-diversity of central European forests decreases along an elevational gradient due to the variation in local community assembly processes. Ecography 41, 1038–1048 (2018).
Večeřa, M. et al. Alpha diversity of vascular plants in European forests. J. Biogeogr. 46, 1919–1935 (2019).
Wüest, R. O. et al. Macroecology in the age of Big Data—Where to go from here? J. Biogeogr. 47, 1–12 (2019).
Valavi, R., Elith, J., Lahoz-Monfort, J. J. & Guillera-Arroita, G. blockCV: an r package for generating spatially or environmentally separated folds for k-fold cross-validation of species distribution models. Methods Ecol. Evol. 10, 225–232 (2019).
Ploton, P. et al. Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nat. Commun. 11, 4540 (2020).
Belitz, K. & Stackelberg, P. Evaluation of six methods for correcting bias in estimates from ensemble tree machine learning regression models. Environ. Model. Softw. 139, 105006 (2021).
Myers, N., Mittermeier, R. A., Mittermeier, C. G., Da Fonseca, G. A. & Kent, J. Biodiversity hotspots for conservation priorities. Nature 403, 853 (2000).
Barthlott, W., Mutke, J., Rafiqpoor, D., Kier, G. & Kreft, H. Global centers of vascular plant diversity. Nova Acta Leopoldina NF 92, 61–83 (2005).
Testolin, R. et al. Global patterns and drivers of alpine plant species richness. Glob. Ecol. Biogeogr. 30, 1218–1231 (2021).
Wilson, J. B., Peet, R. K., Dengler, J. & Pärtel, M. Plant species richness: the world records. J. Veg. Sci. 23, 796–802 (2012).
Chytrý, M. et al. The most species-rich plant communities in the Czech Republic and Slovakia (with new world records). Preslia 87, 217–278 (2015).
Whitmore, T. C., Peralta, R. & Brown, K. Total species count in a Costa Rican tropical rain forest. J. Trop. Ecol. 1, 375–378 (1985).
Chytrý, M. et al. High species richness in hemiboreal forests of the northern Russian Altai, southern Siberia. J. Veg. Sci. 23, 605–616 (2012).
Duivenvoorden, J. Vascular plant species counts in the rain forests of the middle Caquetá area, Colombian Amazonia. Biodivers. Conserv. 3, 685–715 (1994).
Balslev, H., Valencia, R., Paz y Miño, G., Christensen, H. & Nielsen, I. in Forest Biodiversity in North, Central and South America and the Carribean: Research and Monitoring. Man and the Biosphere Series (eds. Dallmeier, F. & Comiskey, J. A.) (Unesco and The Parthenon Publishing Group, 1998).
Mendieta‐Leiva, G. et al. EpIG‐DB: a database of vascular epiphyte assemblages in the Neotropics. J. Veg. Sci. 31, 518–528 (2020).
Spicer, M. E., Mellor, H. & Carson, W. P. Seeing beyond the trees: a comparison of tropical and temperate plant growth forms and their vertical distribution. Ecology 101, e02974 (2020).
Royo, A. A. & Carson, W. P. The herb community of a tropical forest in central Panama: dynamics and impact of mammalian herbivores. Oecologia 145, 66–75 (2005).
Sosef, M. S. M. et al. Exploring the floristic diversity of tropical Africa. BMC Biol. 15, 15 (2017).
Dwomoh, F. K. & Wimberly, M. C. Fire regimes and forest resilience: alternative vegetation states in the West African tropics. Landsc. Ecol. 32, 1849–1865 (2017).
Condit, R. et al. Beta-diversity in tropical forest trees. Science 295, 666–669 (2002).
Cao, K. et al. Species packing and the latitudinal gradient in beta-diversity. Proc. R. Soc. B 288, 20203045 (2021).
Zhong, Y. et al. Arbuscular mycorrhizal trees influence the latitudinal beta-diversity gradient of tree communities in forests worldwide. Nat. Commun. 12, 3137 (2021).
Graco-Roza, C. et al. Distance decay 2.0—a global synthesis of taxonomic and functional turnover in ecological communities. Glob. Ecol. Biogeogr. 31, 1399–1421 (2022).
Johnson, D. J., Condit, R., Hubbell, S. P. & Comita, L. S. Abiotic niche partitioning and negative density dependence drive tree seedling survival in a tropical forest. Proc. R. Soc. B 284, 20172210 (2017).
Stevens, G. C. The latitudinal gradient in geographical range: how so many species coexist in the tropics. Am. Naturalist 133, 240–256 (1989).
Andermann, T., Antonelli, A., Barrett, R. L. & Silvestro, D. Estimating alpha, beta, and gamma diversity through deep learning. Front Plant Sci. 13, 839407 (2022).
Cardoso, D. et al. Amazon plant diversity revealed by a taxonomically verified species list. Proc. Nat. Acad. Sci. 114, 10695–10700 (2017).
Cayuela, L. et al. Species distribution modeling in the tropics: problems, potentialities, and the role of biological data for effective species conservation. Trop. Conserv. Sci. 2, 319–352 (2009).
Lenoir, J. et al. Local temperatures inferred from plant communities suggest strong spatial buffering of climate warming across Northern Europe. Glob. Change Biol. 19, 1470–1481 (2013).
Ellis, E. C., Antill, E. C. & Kreft, H. All is not loss: plant biodiversity in the Anthropocene. PLoS ONE 7, e30535 (2012).
Kattge, J. et al. TRY plant trait database-enhanced coverage and open access. Glob. Change Biol. 26, 119–188 (2020).
Dengler, J. et al. The Global Index of Vegetation-Plot Databases (GIVD): a new resource for vegetation science. J. Veg. Sci. 22, 582–597 (2011).
Lopez‐Gonzalez, G., Lewis, S. L., Burkitt, M. & Phillips, O. L. ForestPlots.net: a web application and research tool to manage and analyse tropical forest plot data. J. Veg. Sci. 22, 610–613 (2011).
Chytrý, M. Database of Masaryk University Vegetation Research in Siberia. Biodiver. Ecol. 4, 290 (2012).
Schmidt, M. et al. The West African Vegetation Database. Biodiv. Ecol. 4, 105–110 (2012).
Muche, G., Schmiedel, U. & Jürgens, N. BIOTA Southern Africa Biodiversity Observatories Vegetation Database. Biodiver. Ecol. 4, 111–123 (2012).
Revermann, R. et al. Vegetation database of the Okavango Basin. Phytocoenologia 46, 103–104 (2016).
N’Guessan, A. E. et al. Drivers of biomass recovery in a secondary forested landscape of West Africa. Ecol. Manag. 433, 325–331 (2019).
Müller, J. Zur Vegetationsökologie der Savannenlandschaften im Sahel Burkina Fasos (Frankfurt-Main Universität, 2003).
Kearsley, E. et al. Conventional tree height–diameter relationships significantly overestimate aboveground carbon stocks in the Central Congo Basin. Nat. Commun. 4, 2269 (2013).
Djomo Nana, E. et al. Relationship between Survival Rate of Avian Artificial Nests and Forest Vegetation Structure along a Tropical Altitudinal Gradient on Mount Cameroon. Biotropica 47, 758–764 (2015).
Wana, D. & Beierkuhnlein, C. Responses of plant functional types to environmental gradients in the south‐west Ethiopian highlands. J. Trop. Ecol. 27, 289–304 (2011).
Finckh, M. Vegetation Database of Southern Morocco. Biodiver. Ecol. 4, 297 (2012).
Strohbach, B. & Kangombe, F. National Phytosociological Database of Namibia. Biodiver. Ecol. 4, 298–298 (2012).
Samimi, C. Das Weidepotential im Gutu‐Distrikt (Zimbabwe)—Möglichkeiten und Grenzen der Modellierung unter Verwendung von Landsat TM‐5. Vol. 19 (2003).
Černý, T. et al. Classification of Korean forests: patterns along geographic and environmental gradients. Appl. Veg. Sci. 18, 5–22 (2015).
Nowak, A. et al. Vegetation of Middle Asia: the project state of the art after ten years of survey and future perspectives. Phytocoenologia 47, 395–400 (2017).
Liu, H., Cui, H., Pott, R. & Speier, M. Vegetation of the woodland‐steppe ecotone in southeastern Inner Mongolia, China. J. Veg. Sci. 11, 525–532 (2000).
Wang, Y. et al. Combined effects of livestock grazing and abiotic environment on vegetation and soils of grasslands across Tibet. Appl. Veg. Sci. 20, 327–339 (2017).
Bruelheide, H. et al. Community assembly during secondary forest succession in a Chinese subtropical forest. Ecol. Monogr. 81, 25–41 (2011).
Cheng, X.-L. et al. Taxonomic and phylogenetic diversity of vascular plants at Ma’anling volcano urban park in tropical Haikou, China: Reponses to soil properties. PLoS ONE 13, e0198517 (2018).
Hatim, M. Vegetation Database of Sinai in Egypt. Biodiver. Ecol. 4, 303 (2012).
Drescher, J. et al. Ecological and socio-economic functions across tropical land use systems after rainforest conversion. Philos. Trans. R. Soc. Lond. B: Biol. Sci. 371, 20150275 (2016).
Dolezal, J., Dvorsky, M. & Kopecky, M. Vegetation dynamics at the upper elevational limit of vascular plants in Himalaya. Sci. Rep. 6, 24881 (2016).
Borchardt, P. & Schickhoff, U. Vegetation Database of South‐Western Kyrgyzstan—the walnut‐wildfruit forests and alpine pastures. Biodiver. Ecol. 4, 309 (2012).
Wagner, V. Eurosiberian meadows at their southern edge: patterns and phytogeography in the NW Tien Shan. J. Veg. Sci. 20, 199–208 (2009).
von Wehrden, H., Wesche, K. & Miehe, G. Plant communities of the southern Mongolian Gobi. Phytocoenologia 39, 331–376 (2009).
Chepinoga, V. V. Wetland Vegetation Database of Baikal Siberia (WETBS). Biodiver. Ecol. 4, 311 (2012).
Korolyuk, A. et al. Database of Siberian Vegetation (DSV). Biodiver. Ecol. 4, 312–312 (2012).
El-Sheikh, M. A. et al. SaudiVeg ecoinformatics: aims, current status and perspectives. Saudi J. Biol. Sci. 24, 389–398 (2017).
Vanselow, K. A. Eastern Pamirs—a vegetation‐plot database for the high mountain pastures of the Pamir Plateau (Tajikistan). Phytocoenologia 46, 105 (2016).
De Sanctis, M. & Attorre, F. Socotra Vegetation Database. Biodiver. Ecol. 4, 315 (2012).
Chabbi, A. & Loescher, H. W. Terrestrial Ecosystem Research Infrastructures: Challenges and Opportunities (CRC Press, 2017).
Ibanez, T. et al. Structural and floristic diversity of mixed rainforest in New Caledonia: New data from the New Caledonian Plant Inventory and Permanent Plot Network (NC‐PIPPN). Appl. Veg. Sci. 17, 386–397 (2014).
Wiser, S. K., Bellingham, P. J. & Burrows, L. E. Managing biodiversity information: development of New Zealand’s National Vegetation Survey databank. N. Z. J. Ecol. 25, 1–17 (2001).
Whitfeld, T. J. S. et al. Species richness, forest structure, and functional diversity during succession in the New Guinea lowlands. Biotropica 46, 538–548 (2014).
Dengler, J. & Rūsiņa, S. Database dry grasslands in the Nordic and Baltic Region. Biodiver. Ecol. 4, 319–320 (2012).
Biurrun, I., García-Mijangos, I., Campos, J. A., Herrera, M. & Loidi, J. Vegetation-plot database of the University of the Basque Country (BIOVEG). Biodiver. Ecol. 4, 328 (2012).
Vassilev, K., Stevanović, Z. D., Cušterevska, R., Bergmeier, E. & Apostolova, I. Balkan Dry Grasslands Database. Biodiver. Ecol. 4, 330–330 (2012).
Marcenò, C. & Jiménez‐Alfaro, B. The Mediterranean Ammophiletea Database: a comprehensive dataset of coastal dune vegetation. Phytocoenologia 47, 95–105 (2017).
Vassilev, K. et al. Balkan Vegetation Database: historical background, current status and future perspectives. Phytocoenologia 46, 89–95 (2016).
Landucci, F. et al. WetVegEurope: a database of aquatic and wetland vegetation of Europe. Phytocoenologia 45, 187–194 (2015).
Peterka, T., Jiroušek, M., Hájek, M. & Jiménez‐Alfaro, B. European Mire Vegetation Database: a gap‐oriented database for European fens and bogs. Phytocoenologia 45, 291–297 (2015).
De Sanctis, M., Fanelli, G., Mullaj, A. & Attorre, F. Vegetation database of Albania. Phytocoenologia 47, 107–108 (2017).
Willner, W., Berg, C. & Heiselmayer, P. Austrian Vegetation Database. Biodiver. Ecol. 4, 333 (2012).
Apostolova, I., Sopotlieva, D., Pedashenko, H., Velev, N. & Vasilev, K. Bulgarian Vegetation Database: historic background, current status and future prospects. Biodiver. Ecol. 4, 141–148 (2012).
Wohlgemuth, T. Swiss Forest Vegetation Database. Biodiver. Ecol. 4, 340 (2012).
Chytrý, M. & Rafajová, M. Czech National Phytosociological Database: basic statistics of the available vegetation‐plot data. Preslia 75, 1–15 (2003).
Jansen, F., Dengler, J. & Berg, C. VegMV—the vegetation database of Mecklenburg‐Vorpommern. Biodiver. Ecol. 4, 149–160 (2012).
Ewald, J., May, R. & Kleikamp, M. VegetWeb—the national online‐repository of vegetation plots from Germany. Biodiver. Ecol. 4, 173–175 (2012).
Jandt, U. & Bruelheide, H. German vegetation reference database (GVRD). Biodiver. Ecol. 4, 355–355 (2012).
Garbolino, E., De Ruffray, P., Brisse, H. & Grandjouan, G. The phytosociological database SOPHY as the basis of plant socio-ecology and phytoclimatology in France. Biodiver. Ecol. 4, 177–184 (2012).
Dimopoulos, P. & Tsiripidis, I. Hellenic Natura 2000 Vegetation Database (HelNAtVeg). Biodiver. Ecol. 4, 388 (2012).
Fotiadis, G., Tsiripidis, I., Bergmeier, E. & Dimopoulos, P. Hellenic Woodland Database. Biodiver. Ecol. 4, 389 (2012).
Stančić, Z. Phytosociological Database of Non‐Forest Vegetation in Croatia. Biodiver. Ecol. 4, 391 (2012).
Lájer, K. et al. Hungarian Phytosociological database (COENODATREF): sampling methodology, nomenclature and its actual stage. Ann. Botanica Nuova Ser. 7, 197–201 (2008).
Landucci, F. et al. VegItaly: The Italian collaborative project for a national vegetation database. Plant Biosyst. 146, 756–763 (2012).
Casella, L., Bianco, P. M., Angelini, P. & Morroni, E. Italian National Vegetation Database (BVN/ISPRA). Biodiver. Ecol. 4, 404 (2012).
Agrillo, E. et al. Nationwide Vegetation Plot Database—Sapienza University of Rome: state of the art, basic figures and future perspectives. Phytocoenologia 47, 221–229 (2017).
Rūsiņa, S. Semi‐natural Grassland Vegetation Database of Latvia. Biodiver. Ecol. 4, 409 (2012).
Schaminée, J. H. J. et al. Schatten voor de natuur. Achtergronden, inventaris en toepassingen van de Landelijke Vegetatie Databank (KNNV Uitgeverij, 2006).
Kącki, Z. & Śliwiński, M. The Polish Vegetation Database: structure, resources and development. Acta Soc. Bot. Pol. 81, 75–79 (2012).
Indreica, A., Turtureanu, P. D., Szabó, A. & Irimia, I. Romanian Forest Database: a phytosociological archive of woody vegetation. Phytocoenologia 47, 389–393 (2017).
Vassilev, K. et al. The Romanian Grassland Database (RGD): historical background, current status and future perspectives. Phytocoenologia 48, 91–100 (2018).
Aćić, S., Petrović, M., Dajić Stevanović, Z. & Šilc, U. Vegetation database Grassland vegetation in Serbia. Biodiver. Ecol. 4, 418 (2012).
Golub, V. et al. Lower Volga Valley Phytosociological Database. Biodiver. Ecol. 4, 419 (2012).
Lysenko, T., Kalmykova, O. & Mitroshenkova, A. Vegetation Database of the Volga and the Ural Rivers Basins. Biodiver. Ecol. 4, 420–421 (2012).
Prokhorov, V., Rogova, T. & Kozhevnikova, M. Vegetation database of Tatarstan. Phytocoenologia 47, 309–313 (2017).
Šilc, U. Vegetation Database of Slovenia. Biodiver. Ecol. 4, 428 (2012).
Šibík, J. Slovak Vegetation Database. Biodiver. Ecol. 4, 429 (2012).
Kuzemko, A. Ukrainian Grasslands Database. Biodiver. Ecol. 4, 430 (2012).
Cayuela, L. et al. The Tree Biodiversity Network (BIOTREE-NET): prospects for biodiversity research and conservation in the Neotropics. Biodiver. Ecol. 4, 211–224 (2012).
Wagner, V., Spribille, T., Abrahamczyk, S. & Bergmeier, E. Timberline meadows along a 1000 km transect in NW North America: species diversity and community patterns. Appl. Veg. Sci. 17, 129–141 (2014).
Aubin, I., Gachet, S., Messier, C. & Bouchard, A. How resilient are northern hardwood forests to human disturbance? An evaluation using a plant functional group approach. Ecoscience 14, 259–271 (2007).
Sieg, B., Drees, B. & Daniëls, F. J. A. Vegetation and altitudinal zonation in continental West Greenland. Medd. om. Gr.ønland Biosci. 57, 1–93 (2006).
Peet, R. K., Lee, M. T., Jennings, M. D. & Faber-Langendoen, D. VegBank—a permanent, open-access archive for vegetation-plot data. Biodiv. Ecol. 4, 233–241 (2012).
Peet, R. K. et al. Vegetation‐plot database of the Carolina Vegetation Survey. Biodiver. Ecol. 4, 243–253 (2012).
Walker, D. A. et al. The Alaska Arctic Vegetation Archive (AVA‐AK). Phytocoenologia 46, 221–229 (2016).
Peyre, G. et al. VegPáramo, a flora and vegetation database for the Andean páramo. Phytocoenologia 45, 195–201 (2015).
Vibrans, A. C., Sevgnani, L., Lingner, D. V., Gasper, A. L. & Sabbagh, S. The Floristic and Forest Inventory of Santa Catarina State (IFFSC): methodological and operational aspects. Pesqui. Florest. Brasileira 30, 291–302 (2010).
Pauchard, A., Fuentes, N., Jiménez, A., Bustamante, R. & Marticorena, A. In Plant Invasions in Protected Areas (eds Foxcroft, L., Pyšek, P., Richardson, D., Genovesi, P.) (Springer, 2013).
González-Caro, S., Umaña, M. N., Álvarez, E., Stevenson, P. R. & Swenson, N. G. Phylogenetic alpha and beta diversity in tropical tree assemblages along regional-scale environmental gradients in northwest South America. J. Plant Ecol. 7, 145–153 (2014).
Bresciano, D., Altesor, A. & Rodríguez, C. The growth form of dominant grasses regulates the invasibility of Uruguayan grasslands. Ecosphere 5, 1–12 (2014).
Aiba, S.-i & Kitayama, K. Structure, composition and species diversity in an altitude-substrate matrix of rain forest tree communities on Mount Kinabalu, Borneo. Plant Ecol. 140, 139–157 (1999).
Armstrong, A. H., Shugart, H. H. & Fatoyinbo, T. E. Characterization of community composition and forest structure in a Madagascar lowland rainforest. Tropical Conserv. Sci. 4, 428–444 (2011).
Ayyappan, N. & Parthasarathy, N. Biodiversity inventory of trees in a large-scale permanent plot of tropical evergreen forest at Varagalaiar, Anamalais, Western Ghats, India. Biodivers. Conserv 8, 1533–1554 (1999).
Balslev, H., Valencia, R., Paz y Miño, G., Christensen, H. & Nielsen, I. In Forest biodiversity in North, Central and South America, and the Caribbean: research and monitoring (eds. Dallmeier, F. & Comiskey, J. A.) 585–594 (1998).
Bordenave, B. G., Granville, J.-J. D. & Hoff, M. Measurement of species richness of vascular plants in a neotropical rain forest in French Guiana. (1998).
Boyle, T. J. B. & Boontawee, B. CIFOR’s Research Programme on Conservation of Tropical Forest Genetic Resources, 395 (Center for International Forestry Research CIFOR, 1995).
Bunyavejchewin, S., Baker, P. J., LaFrankie, J. V. & Ashton, P. S. Stand structure of a seasonal dry evergreen forest at Huai Kha Khaeng Wildlife Sanctuary, western Thailand. Nat. Hist. Bull. Siam Soc. 49, 89–106 (2001).
Cadotte, M. W., Franck, R., Reza, L. & Lovett-Doust, J. Tree and shrub diversity and abundance in fragmented littoral forest of southeastern Madagascar. Biodivers. Conserv. 11, 1417–1436 (2002).
Cano Ortiz, A. et al. Phytosociological study, diversity and conservation status of the cloud forest in the Dominican Republic. Plants (Basel, Switzerland) 9, 741 (2020).
Chisholm, R. A. et al. Scale-dependent relationships between tree species richness and ecosystem function in forests. J. Ecol. 101, 1214–1224 (2013).
Chu, C. et al. Direct and indirect effects of climate on richness drive the latitudinal diversity gradient in forest trees. Ecol. Lett. 22, 245–255 (2019).
Condit, R. S. et al. Tropical Tree a—Diversity: Results From a Worldwide Network of Large Plots (CABI, 2005).
D’Amico, C. & Gautier, L. Inventory of a 1-ha lowland rainforest plot in Manongarivo, (NW Madagascar). Candollea 55, 319–340 (2000).
Davidar, P., Mohandass, D. & Vijayan, L. Floristic inventory of woody plants in a tropical montane (shola) forest in the Palni hills of the Western Ghats, India. Trop. Ecol. 12, 42–58 (2007).
Davies, S. J. & Becker, P. Floristic composition and stand structure of mixed dipterocarp and heath forests in Brunei Darussalam. J. Trop. Sci. 8, 542–569 (1996).
Duivenvoorden, J. F. Vascular plant species counts in the rain forests of the middle Caquet area. Colomb. Amazon. Biodivers. Conserv. 3, 685–715 (1994).
Ek, R. C. Botanical diversity in the tropical rain forest of Guyana: Botanische diversiteit in het tropisch regenwoud van Guyana. (Met een samenvatting in het Nederlands) (Universiteit Utrecht, 1997).
Galeano, G., Suárez, S. & Balslev, H. Vascular plant species count in a wet forest in the Chocó area on the Pacific coast of Colombia. Biodivers. Conserv. 7, 1563–1575 (1998).
Garrigues, J. P. Action anthropique sur la dynamique des formations végétales au sud de l’Inde (Ghâts occidentaux, Etat du Karnataka, District de Shimoga) (University of Claude Bernard, Lyon I, 1999).
Gastauer, M., Leyh, W. & Meira-Neto, J. A. A. Tree Diversity and Dynamics of the Forest of Seu Nico, Viçosa, Minas Gerais, Brazil. Biodiv. Data J. 3, e5425 (2015).
Helmi, N., Kartawinata, K. & Samsoedin, I. An undescribed lowland natural forest at Bodogol, Gunung Gede Pangrango National Park, Cibodas Biosphere Reserve, West Java, Indonesia. Reinwardtia 13, 33–44 (2009).
Hernández, L., Dezzeo, N., Sanoja, E., Salazar, L. & Castellanos, H. Changes in structure and composition of evergreen forests on an altitudinal gradient in the Venezuelan Guayana Shield. Rev. de. Biol.ía Tropical 60, 11–33 (2012).
Ho, B. C. et al. The plant diversity in Bukit Timah Nature Reserve, Singapore. Gardens’ Bull. Singap. 71, 41–144 (2019).
Hubbel, S. P. & Foster, R. B. In Tropical Rain Forest: Ecology and Management (eds Sutton, S. L., Whitmore, T. C. & Chadwick, S.) 25–41 (Blackwell Scientific Publications,1983).
Kartawinata, K., Samsoedin, I., Heriyanto, M. & Afriastini, J. J. A tree species inventory in a one-hectare plot at the Batang Gadis National Park, North Sumatra, Indonesia. Reinwardtia 12, 145 (2013).
Kiratiprayoon, S. Measuring and monitoring biodiversity in tropical and temperate forests. In: IUFRO Symposium, Chiang Mai (Thailand), 27 Aug-2 (CIFOR, 1994).
KuoJung, C., WeiChun, C., KeiMei, C. & ChangFu, H. Vegetation dynamics of a lowland rainforest at the northern border of the paleotropics at Nanjenshan, southern Taiwan. Taiwan J. Sci. 25, 29–40 (2010).
Lan, G., Zhu, H. & Cao, M. Tree species diversity of a 20-ha plot in a tropical seasonal rainforest in Xishuangbanna, southwest China. J. For. Res. 17, 432–439 (2012).
Lee, H. S. et al. Floristic and structural diversity of 52 hectares of mixed dipterocarp forest in Lambir Hills National Park, Sarawak, Malaysia. J. Trop. Sci. 14, 379–400 (2002).
Linares-Palomino, R. et al. Non-woody life-form contribution to vascular plant species richness in a tropical American forest. Plant Ecol. 201, 87–99 (2009).
Lubini, A. & Mandango, A. Etude phytosociologique et ecologique des forets a Uapaca guineensis dans le nord-est du district forestier central (Zaire). Bull. Jard. Bot. Natl Belg. 51, 231 (1981).
Makana, J.-R., Hart, T. & Hart, J. Forest structure and diversity of lianas and understory treelets in monodominant and mixed stands in the Ituri Forest, Democratic Republic of the Congo. Liana Article Index 20 (1998).
Mansur, M. & Kartawinata, K. Phytosociology of a lower montane forest on Mt. Batulanteh, Sumbawa, Indonesia. Reinwardtia 16, 77 (2017).
Mikoláš, M. et al. Natural disturbance impacts on trade-offs and co-benefits of forest biodiversity and carbon. Proc. R. Soc. B 288, 20211631 (2021).
Mohandass, D. & Davidar, P. Floristic structure and diversity of a tropical montane evergreen forest (shola) of the Nilgiri Mountains, southern India. Trop. Ecol. 50, 219–229 (2009).
Monge González, M. et al. BIOVERA-Tree: tree diversity, community composition, forest structure and functional traits along gradients of forest-use intensity and elevation in Veracruz, Mexico. Biodiv. Data J. 9, e69560 (2021).
Ngo, K. M., Davies, S., Nik, H., Faizu, N. & Lum, S. Resilience of a forest fragment exposed to long-term isolation in Singapore. Plant Ecol. Diver. 9, 397–407 (2016).
Parthasarathy, N. Tree diversity and distribution in undisturbed and human-impacted sites of tropical wet evergreen forest in southern Western Ghats, India. Biodivers. Conserv. 8, 1365–1381 (1999).
Parthasarathy, N. & Karthikeyan, R. Biodiversity and population density of woody species in a tropical evergreen forest in Courtallum reserve forest, Western Ghats, India. Trop. Ecol. 38 (1997).
Pascal, J. P. Wet Evergreen Forests of the Western Ghats of India (Institut français de Pondichéry, 1988).
Pascal, J. P. & Pelissier, R. Structure and floristic composition of a tropical evergreen forest in south-west India. J. Trop. Ecol. 12, 191–214 (1996).
Phillips, O. L. et al. Efficient plot-based floristic assessment of tropical forests. J. Trop. Ecol. 19, 629–645 (2003).
Proctor, J., Anderson, J. M., Chai, P. & Vallack, H. W. Ecological Studies in Four Contrasting Lowland Rain Forests in Gunung Mulu National Park, Sarawak: I. Forest Environment, Structure and Floristics. J. Ecol. 71, 237 (1983).
Ramesh, B. R. et al. Forest stand structure and composition in 96 sites along environmental gradients in the central Western Ghats of India. Ecology 91, 3118 (2010).
Razak, S. A. & Haron, N. W. Phytosociology of Aquilaria Malaccensis Lamk. and its communities from a tropical forest reserve in peninsular Malaysia. Pak. J. Bot. 47, 2143–2150 (2015).
Romoleroux, K. et al. Especies leñosas (dap= 1 cm) encontradas en dos hectáreas de un bosque de la Amazonía ecuatoriana. Estudios sobre diversidad y ecología de plantas, 189–215 (1997).
Sarah, A. R., Nuradnilaila, H., Haron, N. W. & Azani, M. A Phytosociological Study on the Community of Palaquium gutta (Hook. f.) Baill.(Sapotaceae) at Ayer Hitam Forest Reserve, Selangor, Malaysia. Sains Malaysiana 44, 491–496 (2015).
Schrader, J., Moeljono, S., Tambing, J., Sattler, C. & Kreft, H. A new dataset on plant occurrences on small islands, including species abundances and functional traits across different spatial scales. Biodiv. Data J. 8, e55275 (2020).
Sheil, D., Kartawinata, K., Samsoedin, I., Priyadi, H. & Afriastini, J. J. The lowland forest tree community in Malinau, Kalimantan (Indonesian Borneo): results from a one-hectare plot. Plant Ecol. Diver. 3, 59–66 (2010).
Sukumar, R. et al. Long-term monitoring of vegetation in a tropical deciduous forest in Mudumalai, southern India. Curr. Sci. 62, 608–616 (1992).
van Andel, T. R. Floristic composition and diversity of three swamp forests in northwest Guyana. Plant Ecol. 167, 293–317 (2003).
Webb, E. L. & Fa’aumu, S. Diversity and structure of tropical rain forest of Tutuila, American Samoa: effects of site age and substrate. Plant Ecol. 144, 257–274 (1999).
Zimmerman, J. K. et al. Responses of Tree Species to Hurricane Winds in Subtropical Wet Forest in Puerto Rico: Implications for Tropical Tree Life Histories. J. Ecol. 82, 911 (1994).
Olson, D. M. et al. Terrestrial ecoregions of the worlds: a new map of life on Earth. Bioscience 51, 933–938 (2001).
Karger, D. N. et al. Climatologies at high resolution for the earth’s land surface areas. Sci. Data 4, 170122 (2017).
Hengl, T. et al. SoilGrids250m: Global gridded soil information based on machine learning. PLoS ONE 12, e0169748 (2017).
Amatulli, G. et al. A suite of global, cross-scale topographic variables for environmental and biodiversity modeling. Sci. Data 5, 180040 (2018).
Sandel, B. et al. The influence of Late Quaternary climate-change velocity on species endemism. Science 334, 660–664 (2011).
Schultz, J. The Ecozones of the World (Springer, 2005).
Körner, C. et al. A global inventory of mountains for bio-geographical applications. Alp. Bot. 127, 1–15 (2017).
Elith, J., Leathwick, J. R. & Hastie, T. A working guide to boosted regression trees. J. Anim. Ecol. 77, 802–813 (2008).
Hijmans, R. J., Phillips, S., Leathwick, J. & Elith, J. Package ‘dismo’. Available online at: http://cran.r-project.org/web/packages/dismo/index.html (2011).
Zhou, S. et al. Estimating stock depletion level from patterns of catch history. Fish. Fish. 18, 742–751 (2017).
Rocchini, D. et al. Accounting for uncertainty when mapping species distributions: the need for maps of ignorance. Prog. Phys. Geogr. 35, 211–226 (2011).
Potapov, P., Laestadius, L. & Minnemeyer, S. Global map of potential forest cover www.wri.org/forest-restoration-atlas (2011).
Tuanmu, M. N. & Jetz, W. A global 1‐km consensus land‐cover product for biodiversity and ecosystem modelling. Glob. Ecol. Biogeogr. 23, 1031–1045 (2014).
Roberts, D. R. et al. Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 40, 913–929 (2017).
Pebesma, E. & Heuvelink, G. Spatio-temporal interpolation using gstat. RFID J. 8, 204–218 (2016).
R Development Core Team. R: A language and environment for statistical computing v.3.6.1. R Foundation for Statistical Computing http://www.R-project.org/ (2019).
South, A. rnaturalearth: World Map Data from Natural Earth v.0.1.0. R package https://CRAN.R-project.org/package=rnaturalearth (2017).
Sabatini, F. M. et al. Global patterns of vascular plant alpha-diversity [Dataset]. iDiv Data Repository. https://doi.org/10.25829/idiv.3506-p4c0mo (2022).
Sabatini, F. M. fmsabatini/GlobalLocal\_PlantRichness: NatComms R3 v.3. Zenodo https://doi.org/10.5281/zenodo.6659837 (2022).
Acknowledgements
The authors are grateful to thousands of vegetation scientists who sampled vegetation plots in the field or digitized them into regional, national or international databases. We appreciate the support of the German Research Foundation for funding sPlot as one of the iDiv research platforms (DFG FZT 118, 202548816). F.M.S. also acknowledges financial support within the funding programme Open Access Publishing by the German Research Foundation (DFG), and within the Rita-Levi Montalcini (2019) programme, funded by the Italian Ministry of University. In our analyses, we used the iDiv High-Performance Computing (HPC) cluster, for which we in particular acknowledge the support of Christian Krause. M.C. was supported by the Czech Science Foundation (project no. 19-28491X). V.D.P. was supported by the Brazilian National Research Council (CNPq grant 307689/2014-0). P.P. was supported by the long-term research development project No. RVO 67985939 of the Czech Academy of Sciences. A.P. gratefully acknowledges the Grant CONICYT PIA AFB170008 and Grant ANID PIA/BASAL FB210006. S.K.W. was supported by the Strategic Science Investment Fund of the NZ Ministry for Business, Employment and Innovation. A.G.-D.-M. acknowledges for the support of Agencia Española de Cooperación Internacional para el Desarrollo, Universidad CEU San Pablo (Madrid, Spain) and Universidad Privada Antonio Guillermo Urrelo (Cajamarca, Peru). H.K. acknowledges funding from the German Research Foundation (DFG) in the framework of the EFForTS project (Collaborative Research Centre 990. B.H. & J.K.N.’D. acknowledge the DynRecSe Project (C2D AMRUGE). E.A.-D. acknowledges the Red Col-Tree (Red de Monitoreo del bosque en Colombia). Ji.D. was funded by MŠMT Inter-excellence LTAUSA 18007, Czech Science Foundation (21-26883S). S.K. acknowledges funds by the 2019–2020 BiodivERsA joint call for research proposals, under the BiodivClim ERA-Net COFUND program (FeedBaCks, 193907). R.L.P. acknowledges funds by GeoPark Perú. J.J. acknowledges funds from various projects of Senacyt, Fonacon and Usac. O.P., A.M., and R.V. acknowledge support from the UK Department for International Development, a Research Fellowship to OP from the UK Natural Environment Research Council, the Mellon and MacArthur Foundations’ support to the Missouri Botanical Garden, and the European Research Council for enabling our ForestPlots.net contribution to this project. C.V. was partly supported by the Fondation pour la Recherche sur la Biodiversité (FRB) and Electricité de France (EDF) in the context of the CESAB project ‘Causes and consequences of functional rarity from local to global scales’ (FREE). A full list of funding sources for each dataset listed in Supplementary Data 1 is available in Bruelheide et al.15.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
F.M.S. and H.B. conceived the idea, with inputs by B.J.A. and U.J. F.M.S. performed the analysis. F.M.S. drafted the first version of the manuscript. B.J.A., H.B., U.J., M.C., V.D.P., R.F., S.H., M.K., J.L., F.S., Z.T., P.v.B. and S.K.W. provided substantial input on the manuscript and analytical framework. B.J.A., M.A.K., F.A, L.C., M.C., Jü.D., M.D.S., R.F., M.H., A.I., U.J., F.J., M.K., J.L., A.P., R.K.P., P.P., B.S., M.S., K.V., C.V., S.K.W., E.A.-D., P.D., Ji.D., B.H., A.G.-D.-M., J.J., S.K., S.K.-R., H.K., F.L., R.L.-P., J.K.N.’D., A.M.M., O.L.P., G.R.-T., P.S., K.S., B.J.S., R.V.M., H.-F.W., and K.W. provided parts of the data. All co-authors edited the manuscript and provided suggestions on how to improve the analyses.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Tarciso Leão and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Sabatini, F.M., Jiménez-Alfaro, B., Jandt, U. et al. Global patterns of vascular plant alpha diversity. Nat Commun 13, 4683 (2022). https://doi.org/10.1038/s41467-022-32063-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-32063-z
This article is cited by
-
Climate-trait relationships exhibit strong habitat specificity in plant communities across Europe
Nature Communications (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.