Using species distribution modeling to delineate the botanical richness patterns and phytogeographical regions of China

Zhang, Ming-Gang; Slik, J. W. Ferry; Ma, Ke-Ping

doi:10.1038/srep22400

Download PDF

Article
Open access
Published: 01 March 2016

Using species distribution modeling to delineate the botanical richness patterns and phytogeographical regions of China

Ming-Gang Zhang^1,2,
J. W. Ferry Slik³ &
Ke-Ping Ma¹

Scientific Reports volume 6, Article number: 22400 (2016) Cite this article

6633 Accesses
44 Citations
Metrics details

Subjects

Abstract

The millions of plant specimens that have been collected and stored in Chinese herbaria over the past ~110 years have recently been digitized and geo-referenced. Here we use this unique collection data set for species distribution modeling exercise aiming at mapping & explaining the botanical richness; delineating China’s phytogeographical regions and investigating the environmental drivers of the dissimilarity patterns. We modeled distributions of 6,828 woody plants using MaxEnt and remove the collection bias using null model. The continental China was divided into different phytogeographical regions based on the dissimilarity patterns. An ordination and Getis-Ord Gi* hotspot spatial statistics were used to analysis the environmental drivers of the dissimilarity patterns. We found that the annual precipitation and temperature stability were responsible for observed species diversity. The mechanisms causing dissimilarity pattern seems differ among biogeographical regions. The identified environmental drivers of the dissimilarity patterns for southeast, southwest, northwest and northeast are annual precipitation, topographic & temperature stability, water deficit and temperature instability, respectively. For effective conservation of China’s plant diversity, identifying the historical refuge and protection of high diversity areas in each of the identified floristic regions and their subdivisions will be essential.

Spatial patterns of vascular plant species richness in Poland - a data set

Article Open access 18 August 2023

Native range estimates for red-listed vascular plants

Article Open access 29 March 2022

Evaluation metrics and validation of presence-only species distribution models based on distributional maps with varying coverage

Article Open access 15 January 2021

Introduction

In the past ~110 years, millions of specimens have been collected and stored in Chinese herbaria. Over 11,400 woody plants have been identified¹, with about 7,000 of these (c. 61%) endemic². Based on the distribution information of the specimens, efforts have been made to map and explain the geographical variation in plant species richness across China^3,4, identifying both centers of endemicity² and refugia⁵. Explaining the large scale geographical variation in plant species richness is one of the key issues in ecology and the efforts have typically focused on effects of either ‘history’ or ‘ecology’⁶. Some ‘history’ related hypotheses state that tropical areas have higher rates of speciation and that at higher latitudes most tropical species cannot tolerate the abiotic stress (e.g.: cold temperature, water deficit and extreme seasonality) due to evolutionary constraints on their niche breadth⁷. Other historical hypotheses propose that in habitats with a stable environment species have enough time to disperse and adapt, thus the diversity patterns reflect a steady-state relationship between the environment and the evolutionary processes⁸. Alternatively, the ‘ecology’ related hypotheses initially focus on the productivity of terrestrial environments. The productivity of ecosystems is initially considered to be determined by energy or the combined effects of water availability and environmental energy⁹. The strong environmental gradients, complex topography and a long geological history make China a ‘natural lab’ for testing these hypotheses¹⁰.

The geographical patterns of life on earth are shaped by past and current physical and biological forces and can be summarized by biogeographical regionalizations¹¹. When exploring basic and applied questions in biogeography, evolutionary biology, systematics, ecology and conservation, these biogeographical regionalizations provide an indispensable background against which to interpret research findings^11,12. In recent years, with the increasing availability of large databases on species distributions, the importance of robust systems for classifying biogeographical patterns has been emphasized^8,11. For China, the most comprehensive biogeographical regionalization studies were conducted by Wu¹³, who divided China into 4 major floristic regions, 7 sub-regions, 27 areas and 49 sub-areas. However, these studies lacked high-resolution data on species-level distributions and the robust statistical basis that is available today. Wu et al.’s study¹³ also lacked information on the floristic relationships between the identified regions.

Former macro-scale studies were generally based on the raw survey data, which can give rise to several problems. One of these is the geographical collection bias: Most regions of China have incomplete collection effort¹⁴, also termed ‘inventory incompleteness’ leading to incomplete information on actual ranges for many species¹⁵. At the same time, the available collection localities generally do not represent random samples from the available environmental gradients, with most collections coming from sites near roads, in protected areas, or areas of known high plant diversity or special species composition¹⁶. Furthermore, for many regions in China, the original vegetation has disappeared and been replaced by agricultural crops, planted forests, or urbanization. Current collections from these sites thus do no longer represent the actual potential species composition¹⁷. Thus, when using large species collection databases, statistical procedures that take these problems into account are critical for obtaining accurate diversity and species distributional maps.

Species distribution modeling is now widely used to determine distribution patterns at large spatial scales¹⁸. Species distribution models attempt to identify and map the suitable habitat range of species by combining environmental predictors with presence records^19,20. Herbarium collection databases generally provide the primary presence data for such analyses. Currently, most specimens available in Chinese herbaria have been digitalized and are available online¹⁵. Meanwhile, much progress has been made in species distribution modeling, such as dealing with small sample sizes^21,22 and overcoming collection bias using null model tests^23,24. In this study, we aim to (1) use species distribution modeling to map and explain the spatial distribution of the woody plant diversity in mainland China; (2) use the obtained presence/absence data to delineate the phytogeographical regions of China and their relationships at genus and species level; (3) detect the effect of the uncorrelated environmental predictors on the dissimilarity patterns of species.

Results

Of the 34,230 grid cells covering China, 3,068 (9%) have reliably georeferenced plant collections available (Fig. S1). These collecting localities were environmentally biased for 14 of the 15 environmental predictors (Fig. S2). Of the 6,828 modeled species, 6,560 showed significant non-random habitat associations when tested against a collection bias corrected null model (Fig. S3). After stacking of the SDMs of these 6,560 species south-central China showed the highest woody plant diversity (Fig. 1). Mountain ranges in southern China were especially high in diversity, including the Qinling Mountains (28), Daba Mountains (30), Gaoligong Mountains (39), Wuling Mountains (50), Dalou Mountains (52), Daiyun Mountains (56), Nanling Mountains (61), Dayao Mountains (62) and Yunwu Mountains (63). In variation partitioning analysis (Fig. S5), the environmental predictors explained 11% of the species richness independently, while the explanatory power of environmental predictors and spatial predictors combined was 69%. Due to the weak explanatory power (1%) of spatial predictors alone, we only used the environmental predictors to explain the species richness patterns. Among the best predictors of species richness in China were: Annual rainfall (positive), Temperature annual range (negative), Cation-Exchange-Capacity of the clay subsoil (negative) and the topsoil C:N-ratio (positive), with the most important environmental variable (highest β-value) being temperature annual range (Table 1).

Table 1 Result of stepwise multiple regressions between current species richness and environmental predictors.

Full size table

With cluster analysis at genus level, seven phytogeographical regions belonging to four major groupings were identified in continental China (Fig. 2). At species level eleven phytogeographical regions were identified belonging to five major groupings and the major groups were segmented by five dividing lines (I, II, III, IV, V in Fig. 2). The order of the major groupings differed between the genus and species level analyses (Fig. 2). In the relative environmental turnover analysis, the stress value (0.177) suggests an acceptable fit of the environmental data to the clusters in the NMDS ordination (Fig. 3). At continental level, the species turnover towards northeast China correlated highly with temperature annual range and soil drainage class. The species turnover towards northwest China correlated with precipitation seasonality. The species turnover towards southwest China was highly correlated with elevation range and isothermality. The species turnover towards southeast China strongly correlated with annual precipitation (Fig. 3). The Gi* statistic showed that temperature in northeast China is highly unstable (lowest Isothermality and highest temperature annual range), while the temperature in southwest China is most stable. Northwest China is limited by low temperatures and moisture (lowest annual mean temperature and annual precipitation). Compared with other regions, southeast China has the best moisture conditions (highest annual precipitation) (Table 2).

Table 2 Gi* statistic results for the grid cells comprising the 11 woody plants floristic regions of China.

Full size table

Discussion

Botanical richness patterns and their response to the environment

As a country famous for its extremely high species diversity, mapping the geographic patterns of species richness across China and detecting the relationship between species richness and environments is critical to help conserve the biodiversity across the country. Former studies highlight the effect of mean temperature of the coldest quarter¹⁰ and temperature seasonality in shaping the current species diversity patterns in China. Based on the 6,560 significantly nonrandom distributed SDMs, our results suggest the most important variable is annual precipitation. This variable can explain most of richness pattern when tested alone. Meanwhile, their positive correlation suggests that highest richness values are found under relatively high and stable moisture conditions. Temperature annual range explained most of the variance in species richness; their negative correlation suggests that temperature stability is the driving factor for the observed patterns in species richness. Apart from this, the richness pattern showed a limited correlation with soil factors, similar with the former research for woody species in Europe²⁵. However, a weak negative correlation between species diversity and soil fertility can still be found in our study.

Water availability is considered as a strong promoter of species richness by promoting ecosystem productivity^26,27. Normally, it is used to explain the diversity pattern associated with energy²⁸ and their relative importance was thought to be regional specificity²⁹. Water availability was thought to be more important when there is no limitation of energy input³⁰. In southern Africa, O’Brien, et al.³¹ found that the woody plant diversity was most strongly correlated with water availability. In lowland and montane Neotropical forests, water availability was identified as the dominant predictor of liana species richness and wetter forests had a greater species richness of all woody plants³². At global scale, Kreft, et al.³³ found that the water availability is a much stronger constraint of pteridophyte and seed plant richness and humid tropical mountainous regions are conserving most of the pteridophyte richness.

The role of temperature stability in shaping the plant richness pattern has been found by several studies in different continents. In Borneo, the highest species diversity values of woody plants were found under relative stable annual temperature conditions³⁴. In South Africa, the high speciation and low extinction rate were proved to be promoted by temperature stability³⁵. In Australia, the regions with highest species diversity and endemism were also proved have the highest environmental stability³⁶. Recently, many studies even noticed that not only the contemporary climatic stability but the historical climatic stability perform well in predicting the species diversity patterns³⁷. For the Amazon, regional diversity is strongly correlated with historical climatic stability³⁸, while climatic instability is thought to increase extinction rates^39,40. The strong association between species richness and glacial-interglacial climate stability were also found for plants in Europe⁴¹. For palm in tropical rain forests, recent research found that historical climatic stability is the main driver for regional species pool, thus promote the local species richness⁴². This finding could be supported by the historical refuga hypotheses in macro-ecology, but the mechanism still need further test in future studies.

Phytogeographical regions of China and their linkage to the environment

The most comprehensive study on biogeographical regionalization of Chinese plants was carried out by Wu et al. (2010). Our 11 biogeographical sub-regions are in broad agreement with Wu’s definition based on qualitative evidence by experts. However, some differences can still be found between our analysis and Wu’s division: (i) Wu’s Paleotropical zone was divided into two parts by a major split (Line V in Fig. 2D) and being separated into Tibetan Plateau region and Southeast China. Meanwhile, a recent study redefined a larger Paleotropical zone based on the genus distribution and suggested that the line at c. 22°30′N is the reasonable northern biogeographical boundary of the tropical zone⁴³, which can roughly match our division at genus level; (ii) A sharp change on species composition happened in the middle of Wu’s Sino-Japanese floristic region. The major split (Line I in Fig. 2D) strongly match the 0 °C isotherm in January and it is the traditional division of subtropical and warm temperate zones of China; (iii) No clear Eurasian Steppe region can be identified based on the distribution of woody species; this is no surprise that most woody species are coming from the nearby forest. But the genus composition is still quite different from other areas; (iv) The Tianshan and West Kunlun mountain area is identified as a major biogeographical region instead of being separated into Tibetan Plateau region and Central Asian desert region. Additionally, all of the 11 biogeographical sub-regions can be grouped into 5 major biogeographical regions, except the Tianshan and West-Kunlun mountain regions, other four major regions are in agreement with another comprehensive study on biogeographical regionalization that based on the distribution of 509 higher plants⁴⁴.

The NMDS analysis was an efficient tool in identifying major biogeographical splits in continental China, but less sensitive in identifying the sub-regions¹¹. Meanwhile, our analyses confirm the hypothesis that the patterns of species turnover are promoted by the changes in environmental conditions¹² and the mechanisms causing dissimilarity pattern may differ among biogeographical regions⁴⁵. Southeast China is a typical Monsoon tropical & subtropical region. This is a region with the best moisture conditions and weak limitation from energy input. The precipitation has the gradient from southeast coast to inland, which can be mirrored in the species dissimilarity pattern. This finding can be well supported in a study based on the plots survey in montane forests of China⁴⁵. The Southwest China has a distinct isolation from the Southeast Monsoon area due to the geographical barriers formed by the sharp elevating altitude from east to west. The most special character of this region is that the altitude elevating from the south to the north. Meanwhile, due to lower latitudes and higher elevations, the whole regions shown the most stable temperature conditions, which also varies with the altitude gradient together with the dissimilarity pattern of woody plants. Moving to the northern China, the environmental factors with the largest variation is the temperature annual range. The whole region is suffering from the limited energy input and the dissimilarity pattern of woody plants changes with the increased instability of temperature from south to north. Northwest China is strongly influenced by water input and the landscape and species composition of this region show great difference with other part of China. Rather than total rainfall, our analysis showed that the seasonal variation of rainfall is the main driver for the dissimilarity pattern. This result accordance with a recent study conducted in the central deserts in Australia¹².

Conclusions

For China, the water availability and stability of temperature were identified as the main divers of the contemporary species diversity pattern. The two variables explaining most of the variance of the species richness were comparable to the macro-ecological diversity studies in other regions. Based on the high resolution of species level data and the robust method for classifying biogeographical regions, eleven biogeographical regions and five major divisions were successfully recognized. Different driving forces for the dissimilarity pattern of plants for the five major divisions were identified: the precipitation in southeast China; the elevating of altitude and stable temperature in southwest China; the instability of temperature in northeast China and the water deficit in northwest China. Our study found that for effective conservation of China’s plant diversity, the areas with adequate precipitation and most stable temperature need more attention; the importance of historical refuga needs further test. More importantly, the high diversity areas in each of the identified floristic regions and their subdivisions should be considered with special priority for plant conservation.

Methods

Species data

We obtained collection information for c. 4.5 million specimens of Chinese vascular plants present in 42 major herbaria from the Chinese Virtual Herbarium (http://www.cvh.org.cn/cms/en/, accessed September 2012). After removing all non-woody species, 1.1 million specimens (trees, shrubs and lianas) remained. However, most of these had no latitude and longitude information. These collections were georeferenced using the location descriptions as provided on the labels. This resulted in 464,045 specimens with accurate location data. Subsequently, species presences were scored in 10 arc-minute grid cells, avoiding duplicate species records in each grid cell. We used the 10 arc-minute spatial resolutions because this corresponded to the environmental data resolution (WorldClim), but more importantly, because a higher resolution was not possible due to the spatial error in the georeferenced specimen data. Species that were present in fewer than 5 grid cells were removed from the analysis. Of the 464,045 georeferenced specimens, 371,712 records belonging to 157 plant families representing 6,828 species were finally kept to be modeled.

Environmental predictors

Initially, 35 environmental predictors were selected to model the species distribution patterns. These included 19 bioclimatic predictors (1950–2000) plus altitude of the WORLDCLIM dataset (www.worldclim.org) for China at 10 arc-minute resolution and 15 soil variables selected from the FAO database for poverty and insecurity mapping⁴⁶. The FAO soil properties had a spatial resolution of 5 arc-minute, so we re-sampled all soil layers into 10 arc-minute grid cells. The whole mainland of China was covered by 34,230 grid cells with a resolution of 10-arc minutes.

Because multi-colinearity of variables can result in over-fitting in species distribution modeling^22,47, we removed highly correlated environmental predictors. For both bio-climatic and soil predictors, we used spearman’s rank correlation tests to select the least correlated variables (spearman’s rho < 0.75). From correlated variables with Spearman rho higher than 0.75 only the ecologically most meaningful factors were kept. This procedure eventually resulted in the following climatic variables being included in further analyses: (1) Bio01: Annual Mean Temperature; (2) Bio03: Isothermality (P2/P7) × 100 (P2: Mean Diurnal Rang; P7: Temperature Annual Range); (3) Bio07: Temperature Annual Range; (4) Bio12: Annual Precipitation; (5) Bio15: Precipitation Seasonality (Table S1). Of the soil predictors the following variables were included in the analysis: (1) BS-T: base saturation% topsoil; (2) CE-S: CEC clay subsoil (CEC = cation exchange capacity); (3) CN-T: Carbon (C) : Nitrogen (N) ratio class topsoil; (4) CP-T: organic carbon pool topsoil; (5) Depth: effective soil depth; (6) Drain: soil drainage class; (7) NN-T: nitrogen% topsoil; (8) Prod: soil production index; (9) Text.: textural class subsoil (Table S2). In total, 15 of the original 35 predictors were kept to model the species distributions.

Species distribution model building and collection bias correction

In order to model species distributions we used the modeling application Maxent (ver. 3.3.3k; www.cs.princeton.edu/~schapire/maxent/)²⁰. Maxent was specifically developed to model species distributions with presence-only data. Of available species distribution modeling algorithms, Maxent has been shown to perform best, especially when few presence records are available⁴⁸, while it is also the least affected by location errors in occurrences⁴⁹. Maxent was run with the following modeling rules: (1) for species with 5–10 collection records linear features were applied, (2) for species with 10–14 records quadratic features were applied, while (3) for species with >15 records hinge features were applied²³.

As a measure of the accuracy of the SDMs, we used the threshold independent and prevalence insensitive area under the curve (AUC) of the receiver operating characteristic (ROC) plot produced by Maxent. All measures of SDM accuracy require absences⁵⁰. When these are lacking, as is the case here, they are replaced by pseudo-absences or sites randomly selected at localities where no species presence was recorded²⁰. However, when SDM accuracy measures are based on presence-only data and pseudo-absences, the standard measures of accuracy (e.g. the often used measure AUC >0.7) do not apply^23,34. Therefore, we applied the bias corrected null-model developed by Raes and ter Steege (2007) to test the AUC value of an SDM developed with all presence records against the AUC values expected by chance. However, this assumes that collection localities represent a random subset of the study areas environmental space. In many cases this is not a valid assumption due to collecting biases^51,52.

To check for collecting bias in our dataset we tested whether our 3,068 collection localities were random subsamples of the environmental predictor space. To do this we divided each of the 16 environmental predictors into 10 equal-interval bins based on the ranges observed for China (34,230 grid cells)⁵³. We then tested whether the observed frequency distributions represented by the 3,068 collection localities differed from those observed for whole China using a Chi-square test. This showed that for 14 of the 15 environmental predictors, our collection locations represented non-random subsamples of China’s environmental predictor space. To correct for this bias we developed a bias corrected null model by testing each species model AUC value against 1,000 AUC values that were generated randomly by subsampling from all the available collection localities only. When the observed AUC value fell in the top 95% of randomly generated AUC values, it was considered to have a significant non-random distribution and was used in our further analyses. For all the 6,828 available species of China, 6,560 species showed a significantly non-random distribution (AUC value ≥ 95% C.I.).

Species diversity patterns and species turnover index

To define whether a species is present or absent in a grid cell, we need to select a threshold for Maxent prediction values. For species represented by 5–9 records we used either the ‘sensitivity specificity equality’ or the ‘sum maximization’ threshold, For SDMs represented by ≥10 records we used the fixed ‘10 percentile presence’ threshold. Once the threshold is set, a series of presence/absence layers of all the species becomes available. In the next step, a presence/absence matrix was created, the rows representing the 34,230 grid cells covering China and the columns representing the presence of the 6560 modeled species. Species richness was then mapped in ArcGIS 9.3.

In order to explain the species diversity patterns, we performed variation partitioning analysis. This method is especially useful when two sets (environmental and spatial predictors) of independent explanatory variables are involved in explaining the variation of an ecologically dependent variable. The nine terms of the trend-surface analysis regression equation were used to describe the spatial pattern in current species richness⁵⁴:

where LON refers to longitude and LAT refers to latitude. Environmental predictors were formed by contemporary climate predictors and soil predictors. The explanatory power (R_adj²) of groups of variables (Environmental predictors, spatial predictors, combined predictors) was tested using the free software ‘spatial Analysis in Macroecology’ (SAM)⁵⁵. Additionally, we used the same software to obtain the Moran’s I values to test for spatial autocorrelation (RSA) in model residuals. RSA may result from spatially structured biological processes, which violates the statistical independence of observations and may inflate the type I errors⁵⁶. Influential environmental predictors for botanical richness patterns were identified by performing stepwise multiple regressions.

In order to identify phytogeographical regions we applied cluster analysis. For this the pair wise floristic distance between grid cell assemblages need to be measured using an appropriate metric. The most frequently used methods to measure the floristic distance between grid cell assemblages includes Sørensen/Bray–Curtis, Jaccard and Kulczynski, but these are all strongly affected by differences in species richness. This means that a change in community composition has a greater relative influence in relatively species-poor than in diverse assemblages and if there is a large difference in richness between grid cells, the obtained values from these indices will also be large. Kreft and Jetz¹¹ argued that for the purpose of biogeographical regionalizations richness-independent turnover is more informative and therefore suggest the use of metrics that are least affected by the variation in richness. Therefore we used the beta-sim index based on distribution data at genus and species level:

where a = the number of species shared between two grid cells and b and c represent the number of species unique to each grid cell. The value ranges from 0 to 1, where 0 means pairs of grid cells have identical taxa lists and 1 means that they share no taxa.

Bioregionalization based on species compositional dissimilarity

We used the UPGMA method, which is frequently favoured and proven to perform among the best in floristic cluster analysis^11,57. Determining the optimum number of final cluster groups is a long standing issue in biogeographical applications⁴⁵. The indicator species analysis⁵⁸ calculates the indicator taxa at each level of bifurcation in the cluster dendrogram and determines how well they fit the clusters (based on p values)⁵⁹. The significance of the assignment to a cluster group is determined with a Monte-Carlo test using 1000 randomizations. For the cluster analysis, we used summed p-values for 2–30 cluster groups to detect the optimum level of grouping. We randomly drew 1000 grid cells for each analysis and repeated the procedure five times. We found that the lowest summed p-value was reached at eleven clusters, so we used this as the optimum level of grouping. We then performed the cluster analysis on the complete dissimilarity matrix, setting the cluster group level as 11. The indicator species analysis was performed using PCORD 5.0.

Ordination and relative environmental turnover (RET)

Ordination is a widely used technique to produce low dimensional projections of multivariate data by arranging objects (in our case grid cell assemblages) along reduced axes based on taxonomic composition. We performed a NMDS ordination at species level, using the ‘metaMDS’ function of the vegan library⁶⁰ in the statistical software R⁶¹. Pairwise distances were calculated using . One hundred random start points were used to find a stable solution and to avoid local minima¹¹. The stress value reflects the amount of error in the correlation between pairwise distances in the original data matrix and a data matrix calculated from the NMDS plot, with 0 representing no error and 1 indicating a complete lack of correlation.

Once we got the NMDS results, the relative environmental turnover (RET) method was used to investigate the relationship between environmental predictors and the floristic regions of species turnover⁶². RET contains two types of analyses, one that matches the NMDS results with the environmental variables and a second that is based on the grid cell Getis-Ord Gi* hotspot statistic. First, all environmental variables included in species distribution modeling were re-sampled into 20 × 20 arc-minute and an environmental variable matrix was created. Secondly, we fitted NMDS results with the uncorrelated environmental variable matrix using the vector fitting of the envfit function from the vegan package in R statistical software. Thirdly, environmental variables that were significantly related to the patterns of turnover (P < 0.001, 999 permutations) were mapped onto the NMDS ordination plot as vectors.

In the final step, the Getis-Ord Gi* hotspot statistic was used to assess whether the environmental values within each phytogeographical region were significantly different from those of whole continental China⁶³. Here, the clusters with Gi* values >2 or <−2 represent sets of cells that have environmental values significantly different from expected. The environmental layer processing and Getis-Ord Gi* hotspot statistic were performed in ARC-MAP 9.3.

Additional Information

How to cite this article: Zhang, M.-G. et al. Using species distribution modeling to delineate the botanical richness patterns and phytogeographical regions of China. Sci. Rep. 6, 22400; doi: 10.1038/srep22400 (2016).

References

Fang, J. Y., Wang, Z. H. & Tang, Z. Y. Atlas of woody plants in China: distribution and climate. Springer and Higher Education Press. (2011).
Huang, J. H. et al. Identifying hotspots of endemic woody seed plant diversity in China. Divers. Distrib. 18, 673–688 (2012).
Google Scholar
Wang, Z. H., Fang, J. Y., Tang, Z. Y. & Shi, L. Geographical patterns in the beta diversity of China’s woody plants: the influence of space, environment and range size. Ecography 35, 1092–1102 (2012).
Google Scholar
Qian, H. Environmental Determinants of Woody Plant Diversity at a Regional Scale in China. Plos One 8, e75832 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Lopez-Pujol, J., Zhang, F.-M., Sun, H.-Q., Ying, T.-S. & Ge, S. Centres of plant endemism in China: places for survival or for speciation? J. Biogeogr. 38, 1267–1280 (2011).
Google Scholar
Brown, J. H. Why are there so many species in the tropics? J. Biogeogr. 41, 8–22 (2014).
PubMed Google Scholar
Wiens, J. J. & Graham, C. H. In Annual Review of Ecology Evolution and Systematics Annual Review of Ecology Evolution and Systematics Vol. 36 519–539 (2005).
Google Scholar
Hortal, J. et al. Ice age climate, evolutionary constraints and diversity patterns of European dung beetles. Ecol. Lett. 14, 741–748 (2011).
PubMed Google Scholar
Kreft, H. & Jetz, W. Global patterns and determinants of vascular plant diversity. Proc. Natl Acad. Sci. USA 104, 5925–5930 (2007).
ADS CAS Google Scholar
Wang, Z. H., Fang, J. Y., Tang, Z. Y. & Lin, X. Patterns, determinants and models of woody plant diversity in China. P. Roy. Soc. B-Biol. Sci. 278, 2122–2132 (2010).
Google Scholar
Kreft, H. & Jetz, W. A framework for delineating biogeographical regions based on species distributions. J. Biogeogr. 37, 2029–2053 (2010).
Google Scholar
Gonzalez-Orozco, C. E., Laffan, S. W., Knerr, N. & Miller, J. T. A biogeographical regionalization of Australian Acacia species. J. Biogeogr. 40, 2156–2166 (2013).
Google Scholar
Wu, Z. Y., Sun, H., Zhou, Z. K., Li, D. Z. & Peng, H. Floristics of seed plants of China. Science press, Beijing. (2010) (In Chinese).
Qian, H. & Ricklefs, R. E. Latitude, tree species diversity and the metabolic theory of ecology. Global Ecol. Biogeogr. 20, 362–365 (2011).
Google Scholar
Yang, W. J., Ma, K. P., Kreft, H. & Daniel Kissling, W. Geographical sampling bias in a large distributional database and its effects on species richness-environment models. J. Biogeogr. 40, 1415–1426 (2013).
Google Scholar
Yang, W. J., Ma, K. P. & Kreft, H. Environmental and socio-economic factors shaping the geography of floristic collections in China. Global Ecol. Biogeogr. 23, 1284–1292 (2014).
Google Scholar
Svenning, J.-C. & Skov, F. Limited filling of the potential range in European tree species. Ecol. Lett. 7, 565–573 (2004).
Google Scholar
Guisan, A. & Thuiller, W. Predicting species distribution: offering more than simple habitat models. Ecol. Lett. 8, 993–1009 (2005).
Google Scholar
Elith, J. et al. A statistical explanation of MaxEnt for ecologists. Divers. Distrib. 17, 43–57 (2011).
Google Scholar
Phillips, S. J., Anderson, R. P. & Schapire, R. E. Maximum entropy modeling of species geographic distributions. Ecol. Model. 190, 231–259 (2006).
Google Scholar
Hernandez, P. A., Graham, C. H., Master, L. L. & Albert, D. L. The effect of sample size and species characteristics on performance of different species distribution modeling methods. Ecography 29, 773–785 (2006).
Google Scholar
Pearson, R. G., Raxworthy, C. J., Nakamura, M. & Townsend Peterson, A. Predicting species distributions from small numbers of occurrence records: a test case using cryptic geckos in Madagascar. J. Biogeogr. 34, 102–117 (2006).
Google Scholar
Raes, N. & ter Steege, H. A null-model for significance testing of presence-only species distribution models. Ecography 30, 727–736 (2007).
Google Scholar
Zhang, M.-G. et al. Using species distribution modeling to improve conservation and land use planning of Yunnan, China. Biol. Conserv. 153, 257–264 (2012).
Google Scholar
Svenning, J.-C. & Skov, F. The relative roles of environment and history as controls of tree species composition and richness in Europe. J. Biogeogr. 32, 1019–1033 (2005).
Google Scholar
Bai, Y. F. et al. Primary production and rain use efficiency across a precipitation gradient on the Mongolia plateau. Ecology 89, 2140–2153 (2008).
PubMed Google Scholar
Gillman, L. N. et al. Latitude, productivity and species richness. Global Ecol. Biogeogr. 24, 107–117 (2015).
Google Scholar
O’Brien, E. M. Climate and woody plant diversity in southern Africa: relationships at species, genus and family levels. J. Biogeogr. 20, 181–198 (1993).
Google Scholar
Hawkins, B. A. et al. Energy, water and broad-scale geographic patterns of species richness. Ecology 84, 3105–3117 (2003).
Google Scholar
Whittaker, R. J., Nogues-Bravo, D. & Araujo, M. B. Geographical gradients of species richness: a test of the water-energy conjecture of Hawkins et al. (2003) using European data for five taxa. Global Ecol. Biogeogr. 16, 76–89 (2007).
Google Scholar
O’Brien, E. M., Whittaker, R. J. & Field, R. Climate and woody plant diversity in southern Africa: relationships at species, genus and family levels. Ecography 21, 495–509 (1998).
Google Scholar
van der Heijden, G. M. F. & Phillips, O. L. Environmental effects on Neotropical liana species richness. J. Biogeogr. 36, 1561–1572 (2009).
Google Scholar
Kreft, H., Jetz, W., Mutke, J. & Barthlott, W. Contrasting environmental and regional effects on global pteridophyte and seed plant diversity. Ecography 33, 408–419 (2010).
Google Scholar
Raes, N., Roos, M. C., Slik, J. W. F., Van Loon, E. E. & Steege, H. t. Botanical richness and endemicity patterns of Borneo derived from species distribution models. Ecography 32, 180–192 (2009).
Google Scholar
Schnitzler, J. et al. Causes of Plant Diversification in the Cape Biodiversity Hotspot of South Africa. Syst. Biol. 60, 343–357 (2011).
PubMed Google Scholar
Cowling, R. M. et al. Variation in plant diversity in mediterranean-climate ecosystems: the role of climatic and topographical stability. J. Biogeogr. 42, 552–564 (2015).
Google Scholar
Ashcroft, M. B. Identifying refugia from climate change. J. Biogeogr. 37, 1407–1413 (2010).
Google Scholar
Stropp, J., Ter Steege, H. & Malhi, Y. Disentangling regional and local tree diversity in the Amazon. Ecography 32, 46–54 (2009).
Google Scholar
ter Steege, H. et al. A spatial model of tree α-diversity and -density for the Amazon. Biodivers. Conserv. 12, 2255–2277 (2003).
Google Scholar
Letten, A. D., Ashcroft, M. B., Keith, D. A., Gollan, J. R. & Ramp, D. The importance of temporal climate variability for spatial patterns in plant diversity. Ecography 36, 1341–1349 (2013).
Google Scholar
Ordonez, A. & Svenning, J.-C. Geographic patterns in functional diversity deficits are linked to glacial-interglacial climate stability and accessibility. Global Ecol. Biogeogr. 24, 826–837 (2015).
Google Scholar
Kristiansen, T. et al. Local and regional palm (Arecaceae) species richness patterns and their cross-scale determinants in the western Amazon. J. Ecol. 99, 1001–1015 (2011).
Google Scholar
Zhu, H. Geographical elements of seed plants suggest the boundary of the tropical zone in China. Palaeogeogr. Palaeocl. 386, 16–22 (2013).
Google Scholar
Xie, Y., Mackinnon, J. & Li, D. Study on biogeographical divisions of China. Biodivers. Conserv. 13, 1391–1417 (2004).
Google Scholar
Tang, Z. Y. et al. Patterns of plant beta-diversity along elevational and latitudinal gradients in mountain forests of China. Ecography 35, 1083–1091 (2012).
Google Scholar
FAO. Terrastat; global land resources GIS models and databases for poverty and food insecurity mapping. -Land and Water Digital Media Series #20 (2002).
Graham, M. H. Confronting multicollinearity in ecological multiple regression. Ecol. Lett. 84, 2809–2815 (2003).
Google Scholar
Wisz, M. S. et al. Effects of sample size on the performance of species distribution models. Divers. Distrib. 14, 763–773 (2008).
Google Scholar
Graham, C. H. et al. The influence of spatial errors in species occurrence data used in distribution models. J. Appl. Ecol. 45, 239–247 (2007).
Google Scholar
Liu, C. R., White, M. & Newell, G. Measuring and comparing the accuracy of species distribution models with presence-absence data. Ecography 34, 232–243 (2011).
CAS Google Scholar
Kleidon, A. & Mooney, H. A. A global distribution of biodiversity inferred from climatic constraints: result from a process-based modelling study. Global Change Biol. 6, 507–523 (2000).
ADS Google Scholar
Tsoar, A., Allouche, O., Steinitz, O., Rotem, D. & Kadmon, R. A comparative evaluation of presence-only methods for modelling species distribution. Divers. Distrib. 13, 397–405 (2007).
Google Scholar
Loiselle, B. A. et al. Predicting species distributions from herbarium collections: does climate bias in collection sampling influence model outcomes? J. Biogeogr. 35, 105–116 (2008).
Google Scholar
Lobo, J. M. & Martin-Piera, F. Searching for a predictive model for species richness of Iberian dung beetle based on spatial and environmental variables. Conserv. Biol. 16, 158–173 (2002).
Google Scholar
Rangel, T. F., Diniz-Filho, J. A. F. & Bini, L. M. SAM: a comprehensive application for Spatial Analysis in Macroecology. Ecography 33, 46–50 (2010).
Google Scholar
Dormann, C. F. et al. Methods to account for spatial autocorrelation in the analysis of species distributional data: a review. Ecography 30, 609–628 (2007).
Google Scholar
Holt, B. G. et al. An Update of Wallace’s Zoogeographic Regions of the World. Science 339, 74–77 (2013).
ADS CAS Google Scholar
McCune, B., Grace, J. B. & Urban, D. L. Analysis of Ecological Communities (MjM Software Design, 2002).
Dufrene, M. & Legendre, P. Species assemblages and indicator species: The need for a flexible asymmetrical approach. Ecol. Monogr. 97, 345–366 (1997).
Google Scholar
Oksanen, J., Kindt, R., Legendre, P. & O’Hara, R. B. vegan: community ecology package (2006).
R: a language and environment for statistical computing (R foundation for Statistical Computing, Vienna, Austria, 2013).
Gonzalez-Orozco, C. E., Thornhill, A. H., Knerr, N., Laffan, S. & Miller, J. T. Biogeographical regions and phytogeography of the eucalypts. Divers. Distrib. 20, 46–58 (2014).
Google Scholar
Laffan, S. W., Lubarsky, E. & Rosauer, D. F. Biodiverse, a tool for the spatial analysis of biological and related diversity. Ecography 33, 643–647 (2010).
Google Scholar

Download references

Acknowledgements

We are grateful to Guo-Ke Chen, Ji-Hong Huang, Bing Liu,Wu-Bing Xu, Jun-Jie Wang for insightful comments on the earlier version of the manuscript. Bin Chen provided considerable assistance in constructing the database. This study is financially supported by CAS STS project (KFJ-EW-STS-020) and Important Specialized Science and Technology Item of Shanxi Province (Grant no. 20121101011).

Author information

Authors and Affiliations

State Key Laboratory of Vegetation and Environmental Change, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
Ming-Gang Zhang & Ke-Ping Ma
Institute of Loess Plateau, Shanxi University, Taiyuan, 030006, China
Ming-Gang Zhang
Faculty of Science, Universiti Brunei Darussalam, Jln Tungku Link, Gadong, BE1410, Brunei Darussalam
J. W. Ferry Slik

Authors

Ming-Gang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
J. W. Ferry Slik
View author publications
You can also search for this author in PubMed Google Scholar
Ke-Ping Ma
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.G.Z., J.W.F.S. and K.P.M. designed the study, conducted the analyses and wrote the paper. K.P.M. wrote the paper and contributed the data. J.W.F.S. discussed and wrote the paper.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Zhang, MG., Slik, J. & Ma, KP. Using species distribution modeling to delineate the botanical richness patterns and phytogeographical regions of China. Sci Rep 6, 22400 (2016). https://doi.org/10.1038/srep22400

Download citation

Received: 22 October 2015
Accepted: 25 January 2016
Published: 01 March 2016
DOI: https://doi.org/10.1038/srep22400

This article is cited by

Mapping the potential distribution suitability of 16 tree species under climate change in northeastern China using Maxent modelling
- Dan Liu
- Xiangdong Lei
- Shouzheng Tang
Journal of Forestry Research (2022)
Identifying climate refugia for 30 Australian rainforest plant species, from the last glacial maximum to 2070
- Sourav Das
- John B. Baumgartner
- Linda J. Beaumont
Landscape Ecology (2019)
Drivers of floristic richness in the Mediterranean: a case study from Tuscany
- Marco D’Antraccoli
- Francesco Roma-Marzio
- Lorenzo Peruzzi
Biodiversity and Conservation (2019)
Major advances in studies of the physical geography and living environment of China during the past 70 years and future prospects
- Fahu Chen
- Bojie Fu
- Xiaochen Liu
Science China Earth Sciences (2019)
Trees represent community composition of other plant life-forms, but not their diversity, abundance or responses to fragmentation
- Bonifacio O. Pasion
- Mareike Roeder
- Kyle W. Tomlinson
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.