Revealing patterns of local species richness along environmental gradients with a novel network tool

Baudena, Mara; Sánchez, Angel; Georg, Co-Pierre; Ruiz-Benito, Paloma; Rodríguez, Miguel Á.; Zavala, Miguel A.; Rietkerk, Max

doi:10.1038/srep11561

Download PDF

Article
Open access
Published: 25 June 2015

Revealing patterns of local species richness along environmental gradients with a novel network tool

Mara Baudena^1,2,
Angel Sánchez^2,3,
Co-Pierre Georg^4,2,
Paloma Ruiz-Benito^5,6,
Miguel Á. Rodríguez⁶,
Miguel A. Zavala⁶ &
…
Max Rietkerk¹

Scientific Reports volume 5, Article number: 11561 (2015) Cite this article

6641 Accesses
11 Citations
4 Altmetric
Metrics details

Subjects

Abstract

How species richness relates to environmental gradients at large extents is commonly investigated aggregating local site data to coarser grains. However, such relationships often change with the grain of analysis, potentially hiding the local signal. Here we show that a novel network technique, the “method of reflections”, could unveil the relationships between species richness and climate without such drawbacks. We introduced a new index related to potential species richness, which revealed large scale patterns by including at the local community level information about species distribution throughout the dataset (i.e., the network). The method effectively removed noise, identifying how far site richness was from potential. When applying it to study woody species richness patterns in Spain, we observed that annual precipitation and mean annual temperature explained large parts of the variance of the newly defined species richness, highlighting that, at the local scale, communities in drier and warmer areas were potentially the species richest. Our method went far beyond what geographical upscaling of the data could unfold and the insights obtained strongly suggested that it is a powerful instrument to detect key factors underlying species richness patterns and that it could have numerous applications in ecology and other fields.

Integrating multiple sources of ecological data to unveil macroscale species abundance

Article Open access 03 April 2020

Mapping density, diversity and species-richness of the Amazon tree flora

Article Open access 08 November 2023

Biased-corrected richness estimates for the Amazonian tree flora

Article Open access 23 June 2020

Introduction

At large spatial extent, species richness is known to co-vary with climate, typically increasing from the poles towards the tropics^1,2,3,4. Climate determines water availability and ambient energy levels, although the mechanisms connecting these factors to species richness and diversity are still debated^1,5,6,7. Usually plant richness patterns correlate strongly and positively with water availability and e.g. tree richness patterns have been found to be strongly associated with rainfall in both the tropics² and the extratropics^5,8, while ambient energy becomes limiting in cold climates¹.

Analyses of species richness-climate relationships at large geographical extents often use data at coarse grains (e.g. square cells of 10 × 10 km or larger), obtained from scaling up local site data that are supposed to be too noisy to reveal climatic and environmental signals (e.g. refs. 4,9). However, various broad-scale, grid-based studies show that different grains can lead e.g. to changes in the steepness (or even in the direction) of the relationship between species richness and the explanatory climatic variables (see e.g. refs. 10, 11, 12, 13, 14, 15, 16, 17, 18). Similarly, scaling up local site data to larger cells is likely to wipe out local information, related e.g. to environmental variation, or to the community structure (i.e. changing the relative ubiquity of species and introducing species co-occurrences that do not exist at the community level). This in turn would prevent detecting how variations in local species richness relate to climate over broad spatial extents.

Here we study the patterns of species richness at large extent using data at small, local, site scale, with the help of network analysis, which allows to include information from the whole network at the local level. Networks in ecology have been used mainly to describe food-webs or plant-pollinator interactions (see e.g. refs. 19, 20, 21, 22). Some works also used network approaches within a trophic level (e.g. refs. 23, 24, 25, 26). In food-webs and pollinator-plant webs, the species interactions directly define their networks. When representing species within a trophic level, the network links instead often represent co-occurrence, as built out of a classical presence/absence matrix, or in some cases a matrix of species abundances. This kind of matrices have been largely investigated in ecology, e.g. to explain geographical species patterns empirically (e.g. ref. 27), to identify species associations (e.g. ref. 28) or to study theoretically the relationships between patterns of species richness and range sizes²⁹. Network techniques are powerful also when applied within a trophic level, because they account for all the species co-occurrence throughout a certain area in a synthetic way, which allows for global description and insights, without losing the local information.

In the present work, we introduce into ecology a new network technique, called the “method of reflections”, which has been used in economics to show that a few factors drive the product export of countries³⁰. We adapt this method to analyse the distribution and richness of species, obtaining a technique that summarizes information on community structure and species distribution throughout the entire dataset and that uses this information to get an indication of potential species richness at the local level. The method of reflections introduces a new way of removing noise from local site species richness, identifying to which extent species richness of each site is further away (or closer to) its potential.

To demonstrate the added value of this method, we consider the case of woody plants (i.e. trees and shrubs) in mainland Spain, using data from the Spanish Forest Inventory (SFI), which describes forests at local, site scale (i.e. circular plots of ≤25 m radius). Woody plant species richness in this area has been associated to major climate characteristics, namely mean annual temperature and annual precipitation, in contrasting ways, depending whether the studies use local site data³¹, or large cells³². While no study, to our knowledge, has found a clear pattern for local species richness in Spanish forests using a dataset as extensive as the SFI, Terradas et al.³¹ analyse a small subset of community level data sampled in canonical (i.e. well-preserved) forest stands (thus with minimum heterogeneity due to management practices). They detect an increase of species richness with mean annual temperature and a decrease with annual precipitation. Although their study sites are located in NE Spain, this result can be expected to be roughly indicative of the situation across peninsular Spain, since the study covered a strong climatic gradient, encompassing a range of woodland types representative of major Mediterranean and Atlantic woodland formations in Spain. Conversely, the grid-based analysis by Vetaas & Ferrer-Castán³² for the Iberian Peninsula finds opposite trends using cells with a resolution of 50 km, i.e., coarse grain: woody species richness increasing with annual precipitation and decreasing with mean annual temperature. It is not clear why opposite results are obtained for species richness at the local and coarse-grain scales, but it cannot be excluded they are related to the above mentioned sensitivity of upscaling (grid-based) techniques to underlying heterogeneity or grain size (e.g. refs. 13,33).

The objective of this work is twofold. Firstly, we aim at demonstrating the effectiveness of the method of reflections in removing noise and isolating the signal of potential species richness, calculated at the site level from an ecological presence/absence dataset. Secondly, we apply the method of reflections to investigate how the variations of woody plant species richness across local communities in Spain relate to major climate characteristics. For a comparison, we also explore these relationships with the classical upscaling analysis. As we will show, after removing the noise in our data with the method of reflections, we effectively detect climate relationships with local woody species richness, specifically an increase with mean annual temperature and a decrease with annual precipitation, thus displaying similar relationships to those observed in canonical sites (i.e. well-preserved local communities³¹) and in contrast with the opposite patterns that emerge from the upscaling analysis (as in ref. 32).

In the following two sections, we introduce the method of reflections and we illustrate its validity and potential with an example of its application to synthetic datasets. We then apply the method to analyse the SFI and discuss these results and the method relevance and wide applicability. Finally, we report the descriptions of the SFI dataset and of the analyses.

The method of reflections

Here we introduce the “method of reflections”³⁰, which we employ to analyse species richness. We briefly mention also how the method applies to the species frequency of occurrence, which is equivalent to species range size for those species whose range distribution is limited to our study area. We consider the species and the sites as separate parts of a so-called “bipartite” network³⁴, in which plants are connected to the sites where they grow.

For the species used in the analysis, we introduce the species presence/absence matrix M, whose elements M_ps are equal to 1 if a plant species p is present in a site s and 0 otherwise. The matrix M represents a bipartite network with two kinds of nodes, species and sites, where each species is connected to the sites where it is observed. The method of reflections introduces a family of variables (hereafter referred to as “reflections”) that characterize the species and the sites, producing a symmetric set of measures. We start from the definition of two widely used ecological quantities, species frequency of occurrence and site species richness:

where the zeroth order reflection k_p,0 is the number of sites where the plant species p is observed (i.e. its frequency of occurrence) and the zeroth order reflection k_s,0 represents the number of species per site s (i.e. site species richness). These reflections correspond to the so-called “node degree” of species and sites, respectively (e.g. ref. 35).

The method of reflections introduces a generalization of these two quantities, defining N reflections for each type of node, i.e. for both the species and the sites. The reflections of sites (k_p,N) and species (k_s,N) are calculated iteratively as the average value of the previous level properties:

for N > 0, thus obtaining a set of reflections (k_p,0, k_p,1, k_p,2, … k_p,N) for the plant species and a set (k_s,0, k_s,1, k_s,2, … k_s,N) for the sites. For example, the first-order species reflection, k_p,1 is defined as:

and it is thus the average species richness of the sites where a plant species p was observed. The first site reflection, k_s,1 is defined as:

i.e. the average frequency of occurrence of the species growing in a site s. These first order reflections have been occasionally used in previous studies^29,36.

The method of reflections continues the iteration process to higher orders and thus the following iteration for the sites introduces k_s,2 as the weighted average species richness of a site s, where the species assume different weights according to their k_p,1, i.e. to the species richness of the sites where they were observed (see Fig. 1). Species living on average in species-richer sites (i.e. species that are more “social”) have larger weights. In other words, k_s,2 is an index related to site species richness, which includes information on the species richness of all the sites where the species in site s are observed. The process is analogous for the species reflection k_p,2. Iterating further, as shown above for the sequence k_s,0 – k_p,1 - k_s,2 , the information travels between the even reflections of one type of node and the odd reflections of the other type³⁷ (see an illustration in Fig. 1). Thus, we obtain two families of reflections for each type of nodes. For sites, even reflections (with order larger than zero, i.e. k_s,2, k_s,4,…) represent generalized measures of species richness and are thus related to the site zeroth order reflection k_s,0, while odd reflections (i.e. k_s,1, k_s,3, k_s,5,…) are generalized average measures of the frequencies of occurrence of the species present at the site and thus are related to k_p,0 (the zeroth order reflection of the species). For each plant species, even reflections (with order larger than zero, i.e. k_p,2, k_p,4,…) represent generalized measures of its frequency of occurrence, whereas odd reflections (i.e. k_p,1, k_p,3, k_p,5,…) are related to the species richness of the sites where the species was observed (Fig. 1; see also Table 1 for a summary of the reflection meaning).

Table 1 Explanation of the different variables (“reflections”) obtained with the method of reflections, including node type used, reflection symbol, parity, order and name

Full size table

As an example, we consider high order, even site reflections (e.g. k_s,18), which represent generalized site species richness. Let us imagine a site that has many species because it is at an early successional stage as a consequence of a “disturbance”, but most of its species tend to live in monospecific stands, or with few other species, in the rest of the dataset. The generalized species richness of this site would then have a relatively low value, because most of its species get low weights. Analogously, if a site is species poor, but its few species generally co-exist with many other species in the rest of the dataset, the site generalized species richness would be high, because the “social” species have large weights. Thus, generalised species richness is an indication of the potential species richness of a site, keeping into account the general properties of the species that compose its local community, as estimated from the whole network (i.e. the dataset). In the following section we illustrate this mathematical property with an example.

Finally, we notice here that when stating that the reflections are “generalized” measures, we imply that the reflections smooth out the variables they represent throughout similar nodes (i.e. sites or plant species) in the network. In other words, the method isolates the signal from the noise. It is important to note that the generalized measures are not new estimations of site species richness and species frequency of occurrence, but they rather are new, smoothed indices that contain averaged information about those variables. The iterative procedure used to calculate them stabilizes the averaging effect and in fact, the reflections tend to converge to similar values when N is large³⁷. While their values are not informative per se (and as such the reflections can be considered as indices), yet they contain a lot of information in their tiny deviations³⁰, as a consequence of the inclusion at node level of information from the whole network, which leads to the noise removal.

An illustrative application to synthetic datasets

Here we illustrate with an example that the method of reflections can identify the signal of (potential) species richness even when it is hidden behind the randomness of a noisy dataset. We generated two synthetic datasets where the species are stochastically distributed, each according to a known probability of occurrence as a function of the environmental conditions in the sites. In the first dataset (D1), the number of species that could potentially occur in a site changed along the environmental range. In the second dataset (D2), the number of species that could potentially occur did not change along the climatic gradient. We used this dataset to check that the method did not find spurious correlations.

For both datasets, we generated large matrices of species presence/absence in sites (110 species, 10000 sites). To keep the example simple, the environment in each site was described by only one variable, namely annual precipitation, which was randomly assigned to the site. The annual precipitation varied between 0 and 1500 mm y⁻¹ and to obtain a uniform distribution of the values across the sites, it was sampled every 25 mm y⁻¹. The probability of occurrence of each species varied with the annual precipitation and the distributions were normal, truncated between 0 and 1500 mm y⁻¹ (see Supplementary Note, Fig. S1). In the dataset D1, we assumed that several species had identical probabilities of occurrence and the number of species that preferred one end of the precipitation gradient was higher than the number of species that preferred the other end of the gradient. In detail, the number of species that had the maximum probability at a certain precipitation value decreased linearly with precipitation itself, with 20 species having a high probability of occurrence in the driest areas (with mean of the probability distribution μ = 125 mm y⁻¹ and standard deviation σ = 100 mm y⁻¹), 10 species in the wettest (μ = 1375 mm y⁻¹ and σ = 100 mm y⁻¹) and with the number of species decreasing linearly in between these two extremes. We included also twenty “generalist” species that had a broad probability distribution all over the precipitation range (μ = 750 mm y⁻¹ and σ = 500 mm y⁻¹, see Supplementary Note, Fig. S1). In the second dataset (D2), we did not include any trend in the number of species that preferred a certain precipitation. We generated 110 species with mean value of the probability of occurrence chosen at random between 0 and 1500 mm y⁻¹ (sampled every 100 mm y⁻¹ to sample uniformly) and standard deviation of 900 mm y⁻¹. For both datasets, the species were then assigned randomly to the different sites, with a probability that was calculated as a function of the site annual precipitation. For each site and species we generated a random number between 0 and 1 and we assigned the species to the site only if the random number obtained was smaller than the probability of occurrence of the species in that site.

We started from dataset D1 and plotted the species richness data as a function of the annual precipitation of the sites (Fig. 2a). The randomness in the process of assigning the species to the sites hid the linear trend in the species that could potentially live along the precipitation gradient. The best model fit (obtained with Generalised Linear Models – GLM – and selected as described in detail below) was a quadratic model, with species richness decreasing with annual precipitation as expected, but explaining very little variance (R² = 0.06, Fig. 2a and Supplementary Note, Table S1). We then applied the method of reflections up to the order N = 18 and we plotted the generalised species richness k_s,18 as a function of the site annual precipitation (Fig. 2c). The GLM analysis showed that the best fit supported a linear decrease of generalised species richness with annual precipitation, explaining more than 80% of the variance (R² = 0.84, Fig. 2c and Supplementary Note, Table S1). To verify that the method did not introduce spurious correlations, we performed two different tests. Firstly, we bootstrapped the dataset D1, maintaining the same matrix of species presence/absence but shuffling around the site precipitation values randomly. We repeated the shuffling n = 1000 times, obtaining n datasets to which we could apply the method of reflections. No correlation was observed between the generalised species richness and the annual precipitation in these n datasets (average R² ~ 0). Secondly, we applied the method of reflections to the dataset D2. As expected, no model was supported for the fit of annual precipitation and species richness (Fig 2b), but most importantly, also no model fit was supported for generalised species richness k_s,18 and precipitation (Fig 2d). We therefore can state that the method of reflections removes the noise in a large dataset of species presence/absence, identifying the potential species richness signal using only the species composition of the site and their assembly pattern throughout the dataset. With the last two examples, we also verified that the method does not introduce artefactual patterns but it simply highlights hidden signals.

Finally, with the synthetic dataset D1, we illustrated two other important features of the method (see Supplementary Note). Firstly, we showed that the generalised species richness (k_s,18) was a measure of the potential richness of the site, obtained including information about the species patterns of co-occurrence throughout the dataset at site level. This proof was possible for this dataset, because we could estimate the potential richness from the theoretical probability of occurrence of each species in each site (see Supplementary Note, Fig. S2) and it confirmed our explanation of the method as derived from the mathematical formulas (Eqs. 3, 4 and Table 1). Secondly, we could show that the method could identify how far (or close) the sites were from their potential richness, using the changes in site ranking from generalised species richness k_s,18 to species richness k_s,0 (see Supplementary Note, Fig. S3).

Patterns of species richness in the Spanish forests

In this section, we illustrate the results obtained applying the method of reflections to study the patterns of woody species richness along a climate gradient in Spain, using the Spanish Forest Inventory dataset (see below for a description).

We started exploring the relationships between species richness at site scale and two climatic variables separately (namely, annual precipitation and mean annual temperature). We first analysed the data at site scale (as available from the SFI) and then we processed the site data to generate four new grid-based databases, each consisting of equal sized square cells (5 × 5 km, 10 × 10 km, 25 × 25 km and 50 × 50 km, respectively) for which we computed annual precipitation, mean annual temperature and the corresponding species richness (i.e. the number of woody plant species present in the sites each cell contains). In mainland Spain, both site and up-scaled species richness values of woody plants varied largely with annual precipitation and mean annual temperature, following highly heteroscedastic patterns with maximums around 700 mm y⁻¹ and 13 °C respectively (Fig. 3). At the site scale, the best GLM of species richness (selected as described below) decreased linearly with precipitation and (quadratically) increased with temperature, but the model explained variance (i.e. the R²) was very low. At the upscaled grains of analysis, the R² of species richness were quite low, with temperature explaining a quite small fraction of the variance at all grain sizes and with precipitation that tended to explain an increasing fraction of the variance when moving from fine to coarse grains (see Fig. 3 and Supplementary Table S2). Models reflected generally consistent trends across grains, i.e. a cubic relationship of richness with precipitation (with an underlying increasing trend) and predominantly a negative linear relationship with temperature. Thus, the models at different grain sizes essentially captured the same climate signals for species richness.

Subsequently, we applied the method of reflections and calculated site and plant species reflections, up to the order N = 18 (as in ref. 30). Since the reflections are indices and their values are not comparable at different order, to relate them we ranked the sites according to each of their even reflections (from k_s,0 to k_s,18), following ref. 30 and we analysed the patterns of change in ranking. The first aim of this analysis was to observe whether or not the site ranking was changing with increasing order of the reflection. As expected³⁰, in the SFI data the rankings tended to converge after a few iterations (e.g. N ~ 0–12, see Fig. 4), which also confirmed that the maximum order calculated (N = 18) was high enough. The second aim of the rank analyses was to identify which sites were furthest (or closest) to their potential species richness (as explained above in the previous section and in the Supplementary Note). We could clearly notice (in Fig. 4) that some sites were represented by lines ending up approximately at the same height on the right axes from which they started on the left axis. These sites displayed little change in their ranking (i.e. less than 1% rank change from k_s,0 to k_s,18) and we considered them as “typical” sites, meaning that the behaviour of their species was similar throughout the network and their species richness was close to the potential species richness, as expected from their species composition. Other sites instead changed a lot in their ranking position, either from very low species richness (bottom left corner, blue lines) to very high generalized species richness (top right corner), or vice versa from high species richness (red lines) to low generalized values (from top left towards bottom right). We defined as “anomalous” those sites whose rank position changed more than 80%: the method corrected their species richness, identifying it as anomalous with respect to what potentially could be expected given their species composition. To illustrate this, we choose two examples of common tree species that occur largely in the “anomalous” sites (Fagus sylvatica and Pinus halepensis). Fagus sylvatica tends to live with few other woody species. In our dataset, it occurred with less than 5 species in 67% of its sites. No sites including F. sylvatica increased their rank, while 42% of the sites where this species co-occurs with more than 10 other species decreased in ranking position. Since F. sylvatica behaves as a superior competitor in the study region³⁸, it tended to occur in species poor sites and thus its contribution to calculating generalized species richness was low. Therefore, the sites where F. sylvatica was found with many other species decreased in ranking. Another, opposite example is Pinus halepensis, which tends to co-occur with many shrub species, mainly in calcareous sites (see e.g. ref. 39). No sites including P. halepensis decreased their ranking position, while 80% of the sites where this species occurred on its own, or with one other species, increased their ranking. The method identified these low richness sites as “anomalous” and “potentially” rich, as indicated by their ranking according to generalized species richness, which was higher than according to observed species richness. In general, we observed that the sites that did not change in ranking (i.e. the “typical” sites) were evenly distributed geographically, while those that decreased in ranking clustered in Northern Spain (a wetter and cooler region with Atlantic macroclimate) and those that increased rank were mostly at the boundary between the semi-arid and arid Mediterranean areas (see the map in Supplementary Figure S4).

Finally, we explored the relationship between generalized site species richness (k_s,18) and the climate site variables. Linear models (selected following the procedure described in the last subsection of this paper) were the best fit for both site annual precipitation and mean annual temperature (see Supplementary Table S3). Generalised site species richness decreased strongly with increasing site mean annual precipitation (R² = 0.42, Fig. 5a), while it increased with site mean annual temperature, although with a less good fit (R² = 0.25, see Fig. 5b). In other words, generalized site species richness (k_s,18) was higher in arid (dry and warm) sites (i.e. under characteristic Mediterranean macroclimate conditions). We then focused only on the “typical” sites (i.e. those with less than 1% change in ranking from the lowest to the highest reflection) and we analysed the relationship of their site species richness (k_s,0) with the climate variables. We expected that woody species richness of these sites would follow the same pattern of the generalized species richness of all the sites, because the “typical” sites should not change substantially in their behaviour going from low to high reflection order. We found that the species richness (k_s,0) of the “typical” sites displayed the same general trend as the generalized species richness of all the sites. For the selected sites, species richness decreased as a function of annual precipitation (the log-linear model was the best fit, with R² = 0.43, see Fig. 5c) and increased with mean annual temperature for most of the range (the best model being a cubic relationship, with R² = 0.45, see Fig. 5d). We obtained similar results if we defined the “typical” sites using different tolerance intervals (e.g. within 5% or 10% change in ranking, not shown). For further details on model selection, see Supplementary Table S3.

The noise removal was also apparent when comparing the map of site species richness (Fig. 6a), with those of generalized species richness (k_s,18, Fig. 6b) and of species richness (k_s,0) of the “typical” sites (Fig. 6c). While the two latter maps show patterns analogous to those of precipitation and temperature (Fig. 6d,e), the site species richness map (Fig. 6a) follows the climate patterns less clearly when inspected by eye. For example, in the centre-west region of Spain, sites display a wide range of species richness, with many low values (light colours, Fig. 6a), while the high temperature (dark colours, Fig. 6d) and low precipitation (light colours, Fig. 6e) are quite uniform throughout the area.

Discussion

In this work we introduced the method of reflections, a new network technique that successfully isolated the site species richness signal, incorporating at the local level information about the distribution of species from the whole dataset. The method introduced the generalised species richness: a new index that is related to potential species richness. We applied this method to study how woody plant species patterns in mainland Spain relate to precipitation and temperature gradients, finding out that large part of the variation of site (potential) species richness could be explained with annual precipitation (and, to a lesser extent, mean annual temperature). At the local site scale, the method of reflections showed that woody communities in drier and warmer areas tended to have higher richness of woody plant species, and, remarkably, we observed the same signal in the generalized species richness of all the sites and in the (non-processed) species richness of a sub-set of sites that we identified as “typical”. Interestingly, these trends largely coincided with those found by Terradas et al.³¹ in their analysis of woody species richness variation across canonical, well-preserved woodland communities in NE Spain (for more comparable results, see also ref. 40). In contrast, when analysing the data using the classical geographical upscaling, the trends revealed were different (i.e., a third-order polynomial relationship of richness with annual precipitation, with an overall increasing trend and a predominantly negative relationship with mean annual temperature), similarly to what Vetaas & Ferrer-Castán³² reported in their grid-based analysis of woody species richness patterns in the Iberian Peninsula.

The new index we introduced here, the “generalised species richness”, was closely related to site potential species richness. The index combined information in a novel manner, taking species distribution and the inter-species structure into account. The new definition was based on within-network similarities (i.e. community assembly patterns), including information from sites that had similar species composition, irrespective of their geographical locations. The method actively reduced noise and isolated the signal, including information from the whole dataset at the local (site) scale and introducing a sort-of weighted averaging technique, in a way reminiscent of e.g. Principal Component Analyses, but with the advantage of not using information from the explanatory variables. Comparing all the communities with similar taxonomic compositions, the method calculated the generalized species richness of a certain site as expected given its current species composition. The method recognized the sites with uncommon richness as outliers with respect to their species composition, reducing the noise they create and allowing small-scale data to gain greater generality. In this process, the method recognized whether each species in a certain site lived with an anomalous number of other species. However, it did not recognize whether or not these species are always the same throughout the dataset. It is worth remarking that the sites (and their communities) were identified as “anomalous” only because they do not represent the most typical situations, without any implication about their ecological relevance.

As in the case of the product exports of countries³⁰, the method of reflections proved to be able to determine key relationships underlying the network structure also for the woody plant distribution in mainland Spain. Here, the method showed the climate fingerprint in the species distribution very clearly and confirmed our expectation (derived from the findings of Terradas and coauthors³¹, see above) that community level woody species richness tends (potentially) to be higher in the more arid Mediterranean communities. While site species richness in the dataset exhibited only negligible relationships with precipitation and temperature, these climate variables could fit much better the generalized species richness (with very good fit performance, especially for precipitation).

The method identified some sites as anomalous in woody Spanish forests, possibly due, for example, to the existence of extraordinary environmental conditions (e.g. atypical soil characteristics), or to their history of anthropogenic disturbances, which could have lead the local community to be species richer or poorer than potentially expected. Checking the method performance with a few specific examples, we observed that the sites displaying more species than expected often corresponded to sites with F. sylvatica that (for reason such as soil heterogeneity, ungulate grazing, etc.) appear richer than common F. sylvatica forests, typically poor in woody plants because of the superior competitive abilities of this species³⁸. On the contrary, the “anomalous” sites corresponded at large to Mediterranean forests, most of them with P. halepensis, at the boundary between the semi-arid and arid areas of Southeast Iberia, or influenced by the Ebro depression. In these sites, the common trend of P. halepensis dominated forests, i.e. exhibiting a rich cohort of shrub species (e.g. ref. 39), was not present, suggesting that unaccounted environmental or management factors (e.g. fire, grazing or planted origin) had prevented these communities to reach their potential species composition. This was not connected only to these two species and to their communities, which we chose as illustrative examples. In general, there was a tendency for the “anomalous” sites occurring in the Atlantic region of northern Spain to have more species than expected, while the “anomalous” sites of the Mediterranean region exhibited fewer species than expected. In a way, we can say that “noise” and “disturbances” act in the direction of levelling off the species richness differences along the gradient in Spanish forests.

These examples, together with the application to synthetic datasets, illustrate that the method identifies the sites that deviate from most common or “potential” assemblages. These deviations could be due to different underlying processes, such as historical legacies (e.g. land use history or forest management), environmental heterogeneity (e.g. slope aspect, edaphic conditions), trophic interactions (e.g. herbivory, pathogens) or community dynamics (e.g. successional stage). Thus, the method could be used to identify hotspots of species richness (identifying those sites that are richer than potentially expected), or areas where management could be particularly effective in enhancing species richness (i.e. sites where species richness is currently lower than expected). This information could be usefully applied in community or restoration ecology (see also ref. 41), as it allows to infer an indication of potential plant richness for a given site not on the basis of climatic preferences (as in classical gradient analyses), but rather of environmental and/or endogenous mechanisms driving community organization. This kind of application could be previously validated using datasets with different historical records.

The method of reflections is an alternative to classical methods to look at species richness patterns. For example, geographical upscaling aggregates local data at coarser grains, according to their geographical proximity. This is an effective way to smooth out local environmental heterogeneities and it renders species richness data comparable to larger scale climate variables. However, many studies have found not only that the predictive power of environmental predictors often changes with grain size (e.g. ref. 42,43), but also that particular richness-environment relationships may even show different sign depending on the size of the analysis units used (e.g. refs. 13,33). As pointed out by Rahbek & Graves⁴⁴, the use of coarse grains introduce an averaging effect that obscures the fine structure of species richness gradients and localized richness peaks, thus decreasing our possibilities to discriminate the causal agents underlying richness patterns.

Although aggregating local data at coarse grain is in a way similar to what the method of reflections does (i.e. removing noise), our method identifies the local species richness signal and brings out information about potential species richness, integrating information from the species community composition in the whole dataset, instead of using geographical proximity and calculating species richness at coarser grain. For this reason, in our case study, the method of reflections could identify strong relationships of (generalized) species richness with precipitation and temperature (negative and positive, respectively), indicating that local woody communities in dryer and warmer areas across Spain, corresponding to the Mediterranean ecosystems, can potentially have higher woody plant species richness.

The method of reflections showed a very high performance in catching the main factors of variability underlying the dataset analysed and allowing an effective integration of ecological variability to obtain indications of potential species richness. Since presence/absence datasets are very common in ecological data collection, the method could be applied to analyse other geographical areas, other trophic levels or species groups at various scales, possibly leading to very interesting ecological insights on the environmental drivers of species distributions and richness, due to a method that elaborates species richness information on the basis of co-occurrence patterns and community structure rather than on geographical or environmental proximity.

Methods

In the following two subsections, we describe respectively the datasets we used for the analyses of the Spanish woody species and the use of GLM and the model selection procedures. All the analyses in this work were performed using MATLAB R2013b (The MathWorks, Inc.).

The Spanish study area and the datasets

In this paper we analysed data from an extensive dataset describing woody species of continental Spain. This region (492,173 km²) comprises large altitudinal (max. 3500 m.a.s.l.) and climatic gradients (from Atlantic to Mediterranean climate) and consequently displays a large diversity of habitats and species. We used the third Spanish Forest Inventory (SFI), which was surveyed between 1997 and 2007, distributing a 1-km² cell grid over Spain⁴⁵. The data used to compute the presence-absence in each site and thus to build the matrix for the network analysis, were obtained differently for trees and shrubs from the SFI dataset⁴⁵. For trees, the set-up at each site involved four concentric circular sub-plots of 5, 10, 15 and 25 m radius each, which measured individuals with d.b.h. (diameter at breast height) larger than 7.5 cm, 12.5 cm, 22.5 cm and 42.5 cm, respectively in each of the subplot. For shrubs, species presence-absence was recorded at the 10 m radius subplot.

We selected a total of 45,620 sites of the SFI according to the following criteria: (i) being located over mainland Spain; (ii) having at least one adult tree; (iii) showing no evidence of thinning or harvesting; and (iv) not being identified as planted. We obtained the planted character from ref. 46, where the map of Spanish Provenance Regions of Forest Species was joined spatially with the SFI sites, using a definition of planted forests as not originated by natural regeneration from local or nearby native sources^47,48. We included exotic species, because they might influence the distribution of native plants, but excluded all species present in less than 10 sites, obtaining a total of 211 species for the analysis (see Supplementary Table S4). Note that, because we focused on forests with canopy dominant trees, woody species that tend to occur only in open habitats but not in early successional forests might be under-represented.

We considered mean annual temperature and annual precipitation to represent the major climatic conditions of each site in mainland Spain, with climatic variables obtained at 1 × 1 km scale (series 1951–1999, ref. 49). We selected these two climatic variables as representative of the climatic conditions in each site following previous studies^38,46, which showed (using Principal Component Analyses) that these two variables contain the largest environmental variation among an initial set of potentially correlated topographic and climatic variables of the Iberian Peninsula. These two variables are weakly correlated (r² = 0.12), thus minimizing multicollinearity.

GLMs and model selection

To explore the relationships between variables in this work (e.g. species richness and climatic variables), we took the following steps (see also e.g. ref. 32). We used polynomial Generalized Linear Models (GLMs), with: i- log link and a Poisson error distribution for count data (such as species richness); and ii- identity link and a normal error distribution for continuous data (such as generalised species richness). We tested functional forms including from linear up to cubic terms of the explanatory variables. For each model, we calculated the pseudo-R² (as in ref. 50) (which for brevity we call R² in this paper) and the corrected Akaike Information Criterion (AICc⁵¹). For each grain size and each climatic variable, we selected the best model with the following criteria, which tried to maximize parsimony and model fit at the same time, keeping the model complexity as low as possible⁵². Starting from the linear model, we would select a model with higher order (quadratic or cubic) only if: i- R² increased at least of 0.01 when increasing order and ii- the AICc index of the more complex model was at least two unit lower⁵³. We chose not to select any model if none of them had R² larger than 0.01.

Additional Information

How to cite this article: Baudena, M. et al. Revealing patterns of local species richness along environmental gradients with a novel network tool. Sci. Rep. 5, 11561; doi: 10.1038/srep11561 (2015).

References

Hawkins, B. A. et al. Energy, water and broad-scale geographic patterns od species richness. Ecology 84, 3105–3117 (2003).
Article Google Scholar
O’Brien, E. Climatic gradients in woody plant species richness : towards an explanation based on an analysis of southern Africa’s woody flora. J. Biogeogr. 20, 181–198 (1993).
Article Google Scholar
Clarke, A. & Gaston, K. J. Climate, energy and diversity. Proc. R. Soc. B Biol. Sci. 273, 2257–2266 (2006).
Article Google Scholar
Field, R. et al. Spatial species-richness gradients across scales: a meta-analysis. J. Biogeogr. 36, 132–147 (2009).
Article Google Scholar
Hawkins, B. A., Montoya, D., Rodríguez, M. Á., Olalla-Tárraga, M. Á. & Zavala, M. Á. Global Models for Predicting Woody Plant Richness from Climate: Comment. Ecology 88, 255–259 (2007).
Article Google Scholar
Allen, A., Brown, J. & Gillooly, J. Global Biodiversity, Biochemical Kinetics and the Energetic-Equivalence Rule. Science 297, 1545–1548 (2002).
Article CAS ADS Google Scholar
O’Brien, E. M. Biological relativity to water-energy dynamics. J. Biogeogr. 33, 1868–1888 (2006).
Article Google Scholar
Field, R., O ’Brien, E. M. & Lavers, C. P. Global Models for Predicting Woody Plant Richness from Climate : Reply. Ecology 88, 259–262 (2007).
Article Google Scholar
Wright, D. H., Currie, D. J. & Maurer, B. A. in Species Divers. Ecol. communities Hist. Geogr. Perspect. ( Ricklefs, R. E. & Schluter, D. ) 66–74 (Chicago University Press, 1993).
Wang, J. et al. Impact of deforestation in the Amazon basin on cloud climatology. Proc. Natl. Acad. Sci. 106, 3670–3674 (2009).
Article CAS ADS Google Scholar
Hurlbert, A. H. & Jetz, W. Species richness, hotspots and the scale dependence of range maps in ecology and conservation. Proc. Natl. Acad. Sci. 104, 13384–13389 (2007).
Article CAS ADS Google Scholar
Nogués-Bravo, D., Araújo, M. B., Romdal, T. & Rahbek, C. Scale effects and human impact on the elevational species richness gradients. Nature 453, 216–219 (2008).
Article ADS Google Scholar
Hillebrand, H. On the Generality of the Latitudinal Diversity Gradient. Am. Nat. 163, 192–211 (2004).
Article Google Scholar
Hartley, S., Kunin, W. E., Lennon, J. J. & Pocock, M. J. O. Coherence and discontinuity in the scaling of species’ distribution patterns. Proc. R. Soc. B Biol. Sci. 271, 81–88 (2004).
Article Google Scholar
Hartley, S. & Kunin, W. Scale dependency of rarity, extinction risk and conservation priority. Conserv. Biol. 17, 1559–1570 (2003).
Article Google Scholar
Giladi, I., Ziv, Y., May, F. & Jeltsch, F. Scale-dependent determinants of plant species richness in a semi-arid fragmented agro-ecosystem. J. Veg. Sci. 22, 983–996 (2011).
Article Google Scholar
Wilson, R. J., Thomas, C. D., Fox, R., Roy, D. B. & Kunin, W. E. Spatial patterns in species distributions reveal biodiversity change. Nature 432, 393–396 (2004).
Article CAS ADS Google Scholar
Stiles, A. & Scheiner, S. M. A multi-scale analysis of fragmentation effects on remnant plant species richness in Phoenix, Arizona. J. Biogeogr. 37, 1721–1729 (2010).
Article Google Scholar
Montoya, J. M., Pimm, S. L. & Solé, R. V. Ecological networks and their fragility. Nature 442, 259–264 (2006).
Article CAS ADS Google Scholar
Bascompte, J. Disentangling the web of life. Science 325, 416–419 (2009).
Article CAS ADS MathSciNet Google Scholar
Bastolla, U. et al. The architecture of mutualistic networks minimizes competition and increases biodiversity. Nature 458, 1018–1020 (2009).
Article CAS ADS Google Scholar
Saavedra, S., Stouffer, D. B., Uzzi, B. & Bascompte, J. Strong contributors to network persistence are the most vulnerable to extinction. Nature 478, 233–235 (2011).
Article CAS ADS Google Scholar
Verdú, M. & Valiente-Banuet, A. The nested assembly of plant facilitation networks prevents species extinctions. Am. Nat. 172, 751–760 (2008).
Article Google Scholar
Azaele, S., Muneepeerakul, R., Rinaldo, A. & Rodriguez-Iturbe, I. Inferring plant ecosystem organization from species occurrences. J. Theor. Biol. 262, 323–329 (2010).
Article CAS MathSciNet Google Scholar
Araújo, M. B., Rozenfeld, A., Rahbek, C. & Marquet, P. A. Using species co-occurrence networks to assess the impacts of climate change. Ecography (Cop.). 34, 897–908 (2011).
Article Google Scholar
Saiz, H. & Alados, C. L. Structure and spatial self-organization of semi-arid communities through plant – plant co-occurrence networks. Ecol. Complex. 8, 184–191 (2011).
Article Google Scholar
Cottenie, K. Integrating environmental and spatial processes in ecological community dynamics. Ecol. Lett. 8, 1175–1182 (2005).
Article Google Scholar
Gotelli, N. J. Null Model Analysis of Species Co-Occurrence Patterns. Ecology 81, 2606–2621 (2000).
Article Google Scholar
Arita, H. T., Christen, J. A., Rodríguez, P. & Soberón, J. Species diversity and distribution in presence-absence matrices: mathematical relationships and biological implications. Am. Nat. 172, 519–532 (2008).
Article Google Scholar
Hidalgo, C. A. & Hausmann, R. The building blocks of economic complexity. Proc. Natl. Acad. Sci. 106, 10570–10575 (2009).
Article CAS ADS Google Scholar
Terradas, J., Salvador, R., Vayreda, J. & Lloret, F. Maximal species richness: an empirical approach for evaluating woody plant forest biodiversity. For. Ecol. Manage. 189, 241–249 (2004).
Article Google Scholar
Vetaas, O. R. & Ferrer-Castán, D. Patterns of woody plant species richness in the Iberian Peninsula: environmental range and spatial scale. J. Biogeogr. 35, 1863–1878 (2008).
Article Google Scholar
Pecher, C., Fritz, S. A., Marini, L., Fontaneto, D. & Pautasso, M. Scale-dependence of the correlation between human population and the species richness of stream macro-invertebrates. Basic Appl. Ecol. 11, 272–280 (2010).
Article Google Scholar
Newman, M. E. J. The structure and function of complex networks. SIAM Rev. 45, 167–256 (2003).
Article ADS MathSciNet Google Scholar
Thébault, E. Identifying compartments in presence-absence matrices and bipartite networks: insights into modularity measures. J. Biogeogr. n/a–n/a (2012) 10.1111/jbi.12015.
Vázquez, D. P., Aizen, M. a., Ecology, S., May, N. & Vazquez, D. P. Asymmetric Specialization: A Pervasive Feature of Plant-Pollinator Interactions. Ecology 85, 1251–1257 (2004).
Article Google Scholar
Tacchella, A., Cristelli, M., Caldarelli, G., Gabrielli, A. & Pietronero, L. A new metrics for countries’ fitness and products’ complexity. Sci. Rep. 2, 723 (2012).
Article ADS Google Scholar
Gómez-Aparicio, L., García-Valdés, R., Ruíz-Benito, P. & Zavala, M. A. Disentangling the relative importance of climate, size and competition on tree growth in Iberian forests: implications for forest management under global change. Glob. Chang. Biol. 17, 2400–2414 (2011).
Article ADS Google Scholar
Costa, M., Morla, C. & Sáinz, H. Los bosques ibéricos: una interpretación geobotánica. (Editorial Planeta, Barcelona, 1997).
Estevan, H., Lloret, F., Vayreda, J. & Terradas, J. Determinants of woody species richness in Scot pine and beech forests: climate, forest patch size and forest structure. Acta Oecologica 31, 325–331 (2007).
Article ADS Google Scholar
Montoya, D., Rogers, L. & Memmott, J. Emerging perspectives in the restoration of biodiversity-based ecosystem services. Trends Ecol. Evol. 27, 666–672 (2012).
Article Google Scholar
Rahbek, C. & Graves, G. R. Multiscale assessment of patterns of avian species richness. Proc. Natl. Acad. Sci. 98, 84534–84539 (2001).
Article ADS Google Scholar
Belmaker, J. & Jetz, W. Cross-scale variation in species richness–environment associations. Glob. Ecol. Biogeogr. 20, (2011).
Rahbek, C. & Graves, G. R. Detection of macro-ecological patterns in South American hummingbirds is affected by spatial scale. Proc. R. Soc. B Biol. Sci. 267, 2259–2265 (2000).
Article CAS Google Scholar
Villanueva, J. A. (editor). Tercer Inventario Forestal Nacional: 1997-2007. Comunidad de Madrid. (Ministerio de Medio Ambiente, Madrid, Spain, 2004).
Ruiz-Benito, P., Gómez-Aparicio, L. & Zavala, M. A. Large-scale assessment of regeneration and diversity in Mediterranean planted pine forests along ecological gradients. Divers. Distrib. 1–15 (2012) 10.1111/j.1472-4642.2012.00901.x.
Alía, R., Alba, N., Agundez, D. & Iglesias, S. Manual para la comercialización y producción de semillas y plantas forestales: materiales de base y de reproducción. (DGB, Ministerio de Medio Ambiente, Madrid, 2005).
Alía, R. et al. Regiones de procedencia de especies forestales en España. (DGB, Ministerio de Medio Ambiente, Medio Rural y Marino, Madrid, 2009).
Gonzalo, J. Diagnosis fitoclimática de la España peninsular. Actualización y análisis geoestadístico aplicado. (2008).
McFadden, D. in Front. Econom. ( Zarembka, P. ) 105–142 (Academic Press, 1974).
Akaike, H. A New Look at the Statistical Model Identification. IEEE Trans. Automat. Contr. 19, 716–723 (1974).
Article ADS MathSciNet Google Scholar
Johnson, J. B. & Omland, K. S. Model selection in ecology and evolution. Trends Ecol. Evol. 19, 101–108 (2004).
Article Google Scholar
Burnham, K. & Anderson, D. Model selection and multimodel inference: a practical information-theoretic approach (Springer-Verlag, 2002).

Download references

Acknowledgements

This research was supported by the ERA-Net on Complexity through the project RESINEE (“Resilience and interaction of networks in ecology and economics”, EP/I019170/1). M.A.Z. and P.R.B. were supported by REMEDINAL2 (CAM, S2009/AMB-1783). M.A.R. acknowledges supports by the MINECO (grants CGL2010-22119 and CGL2013-48768-P). We thank the MAGRAMA for granting access to the Spanish Forest Inventory data.

Author information

Authors and Affiliations

Copernicus Institute of Sustainable Development, Environmental Sciences Group, Utrecht University, P.O. Box 80115, TC Utrecht, 3508, The Netherlands
Mara Baudena & Max Rietkerk
Departamento de Matemáticas, Grupo Interdisciplinar de Sistemas Complejos (GISC), Universidad Carlos III de Madrid, Avenida de la Universidad 30, Leganés, 28911, Spain
Mara Baudena, Angel Sánchez & Co-Pierre Georg
Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Universidad de Zaragoza, Zaragoza, 50018, Spain
Angel Sánchez
School of Economics and African Institute of Financial Markets and Risk Management, University of Cape Town, Private Bag X1, Rondebosch (Cape Town), 7700, South Africa
Co-Pierre Georg
Biological and Environmental Sciences, School of Natural Sciences, University of Stirling, Stirling, FK9 4LA, United Kingdom
Paloma Ruiz-Benito
Department of Life Sciences, Forest Ecology and Restoration Group, University of Alcalá, Edificio de Ciencias, Campus Universitario, Alcalá de Henares (Madrid), 28805, Spain
Paloma Ruiz-Benito, Miguel Á. Rodríguez & Miguel A. Zavala

Authors

Mara Baudena
View author publications
You can also search for this author in PubMed Google Scholar
Angel Sánchez
View author publications
You can also search for this author in PubMed Google Scholar
Co-Pierre Georg
View author publications
You can also search for this author in PubMed Google Scholar
Paloma Ruiz-Benito
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Á. Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Miguel A. Zavala
View author publications
You can also search for this author in PubMed Google Scholar
Max Rietkerk
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S., C.G., M.A.Z., M.B. and M.R. jointly conceived the project. A.S., C.G., M.B. and M.R. translated the method from economy to ecology. P.R.B. extracted the data. M.B. performed the data analysis (with contributions from M.A.R. and A.S.). M.A.R. also provided feedbacks on the biogeographical interpretation of the results. M.B. wrote the first draft of the manuscript and all authors contributed substantially to revisions.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Baudena, M., Sánchez, A., Georg, CP. et al. Revealing patterns of local species richness along environmental gradients with a novel network tool. Sci Rep 5, 11561 (2015). https://doi.org/10.1038/srep11561

Download citation

Received: 12 March 2015
Accepted: 26 May 2015
Published: 25 June 2015
DOI: https://doi.org/10.1038/srep11561

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.