Siland a R package for estimating the spatial influence of landscape

Carpentier, Florence; Martin, Olivier

doi:10.1038/s41598-021-86900-0

Download PDF

Article
Open access
Published: 05 April 2021

Siland a R package for estimating the spatial influence of landscape

Florence Carpentier^1,2 &
Olivier Martin³

Scientific Reports volume 11, Article number: 7488 (2021) Cite this article

4836 Accesses
4 Citations
7 Altmetric
Metrics details

Subjects

Abstract

The spatial distributions of populations are both influenced by local variables and by characteristics of surrounding landscapes. Understanding how landscape features spatially structure the frequency of a trait in a population, the abundance of a species or the species’ richness remains difficult specially because the spatial scale effects of the landscape variables are unknown. Various methods have been proposed but their results are not easily comparable. Here, we introduce “siland”, a general method for analyzing the effect of landscape features. Based on a sequential procedure of maximum likelihood estimation, it simultaneously estimates the spatial scales and intensities of landscape variable effects. It does not require any information about the scale of effect. It integrates two landscape effects models: one is based on focal sample site (Bsiland, b for buffer) and one is distance weighted using Spatial Influence Function (Fsiland, f for function). We implemented “siland” in the adaptable and user-friendly R eponym package. It performs landscape analysis on georeferenced point observations (described in a Geographic Information System shapefile format) and allows for effects tests, effects maps and models comparison. We illustrated its use on a real dataset by the study of a crop pest (codling moth densities).

A quixotic view of spatial bias in modelling the distribution of species and their diversity

Article Open access 03 May 2023

Landscape Dynamics (landDX) an open-access spatial-temporal database for the Kenya-Tanzania borderlands

Article Open access 18 January 2022

Model systems to elucidate minimum requirements for protected areas networks

Article Open access 20 December 2019

Introduction

Numerous studies demonstrate that the distribution of species richness and abundance depend on both local and landscape variables^1,2,3. However, studying the relationships between landscape and species distributions remains challenging because the shape and the scale of landscape effects are unknown⁴ and can be missed if assessed at an incorrect scale⁵. The studied data usually contains georeferenced observations at point sites, named response variables hereafter, and the description of several landscape spatial variables, named landscape variables hereafter. Their studies are often referred to as focal patch studies⁶. To identify the scale of landscape variable effects, the common approach consists of the following: (i) a priori defining a set of scales; (ii) creating summary variables by computing measures of the landscape variables within discs or rings of radii equal to each scale centered on the observation sites (named buffers hereafter); and (iii) applying a regression model to the response variable with the summary variables as the explanatory variables, for example, a linear model or a random forest algorithm⁷. For each landscape variable, the scale of effect is then considered to be the size of the buffer best explaining the response variable.

The main disadvantage of this method is that the number of explanatory variables artificially increases with the number of spatial scales considered. One then faces a complex statistical dilemma, which is dealing with numerous explanatory variables that by their construction are highly correlated. Consequently, the potential scales chosen are often too few and their ranges are too limited⁸. Finally, the effect of a landscape variable is modelled as uniform within the buffer and as null outside it⁹, which is unrealistic and biologically unjustified as a continuously decreasing effect is expected¹⁰. Several new methods based on distance-weighted effects have been proposed to model a distance-decreasing effect^11,12,13, but they explore a limited predefined set of spatial scales for predictors. Other methods quantified the scale of landscape effects without an a priori choice of spatial scales^9,14. However, none of these methods are yet implemented in a ready-to-use software. Huais¹⁵ proposed a very convenient R function “multifit” to select scales but with some limitation (it is not distance weighted, requires the choice of a set of scales), while calling for further developments of such a method to generalise this type of automated analysis. At present the analysis of landscape effects on population parameters (e.g. abundance or occurrence) therefore remains complex and the results obtained with the methods mentioned above are not easily comparable.

Here we present a general method for landscape effect analysis. We implemented this method in “siland”, a package for the R statistical computing environment. Based on a sequential maximum likelihood estimation procedure, our method improves current methods by providing the joint estimation of the effects of landscape variables as well as the spatial scales of these effects and by allowing comparisons between two spatial effects models. In the first model, Bsiland method hereafter (B for buffer), the effect of a landscape variable is modeled as in classic methods based on focal sample site, i.e. considered as constant over a disc centered on the observation point. But contrarily to the previous method, it does not require a first definition of tested radii since the optimal buffer radii are estimated. In the second model the effect of landscape is based on a weighted distance, as in the framework proposed by Chandler⁹. The decrease in weight with distance is modeled by a Spatial Influence Function (SIF). The parameter of the SIF defined the scale of effect of a landscape variable. In this second method, named Fsiland (F for function), the parameters of the SIFs are estimated for each landscape variable.

The main functions of siland allow the user (i) to estimate the intensity of local and landscape effects and the scale parameter of each landscape variable, (ii) to test these effects and (iii) to plot landscape effects on maps. We exemplified the method and the package use by analysing the landscape effects of conventional and organic orchards on the density of codling moth larvae per apple tree.

Results

Package description

The siland package is written entirely in the scientific computing language R¹⁶. It is available on CRAN (https://cran.r-project.org/package=siland). The analyses presented here were performed using the package siland 2.0.4.

Case study

We illustrated the abilities of siland on an example previously described and analyzed in Ricci¹⁷: the study of codling moth densities, an insect pest specialized on apple orchards in the Basse Durance Valley in southeastern France (see Fig. 1). The datasets can be extracted from the package using the commands data(dataCmoth) and data(landCmoth). The complete analysis script and outputs are available in Supplementary Information file 1 (SI 1). dataCmoth is a data frame with two columns named X and Y containing the observation locations, a column Cmoth, containing the response variable of the study (the mean number of codling moths in the orchards), and a column trait, describing a local variable (the number of insecticide treatments applied in the orchard). landCmoth is a sf object (from package sf¹⁸) describing the landscape variables: conventional tree orchards (conv), organic tree orchards (org) where conv is equal to 1 if the land use is associated to orchard with conventional practice and 0 otherwise, and so is it for org.

Main functions

The main functions are described in Table 1, while a full description of package functions is available at https://cran.r-project.org/web/packages/siland/siland.pdf. The package siland requires two objects containing data. The first object is a data frame composed by two columns named X and Y containing the observation locations, a third column representing the response variable and eventually other columns representing local variables. The second object is a sf object describing landscape variables. It can be obtained directly from shape files of landscape map by using the function st_read() of the package sf (see vignette(“siland") for more details).

Table 1 Main functions of siland package. Detailed information about these functions are given in https://cran.r-project.org/web/packages/siland/siland.pdf.

Full size table

Model estimation is performed using the function Bsiland() for Bsiland method and the function Fsiland() for Fsiland method. The arguments of the both functions are similar: formula of the model, land the sf object describing landscape variables and data, the observation data frame. The syntax of the formula “y ~ model” is similar to lm() function of the stat package. In the model term of the formula, the explanatory variables are added using the symbol “+”. The explanatory variables described in the data frame data are considered as local in the model, those described in the sf object land are considered as landscape variables. Local effects can be modelled as fixed or random (using the syntax (1|x), see vignette(“siland”) for more details). Interaction terms can be considered using the usual symbols “*” or “:”. Notice that only interactions between local x local and local x landscape variables are considered. Various types of response variables can be considered using the argument family which describes the assumed distribution of the response variable and can take the values "gaussian", "poisson" and “binomial” for data of continuous, counting or presence-absence types, respectively (and their associated link functions identity, log and logit respectively).

Using the argument border, the spatial effect of landscape variable can be considered from the observation locations (border = F) or from the border of the polygon where observations are located (border = T) (see Fig. 2). For Fsiland(), the additional argument sif indicates the family of SIF chosen ”exponential” (by default), “gaussian” or “uniform”.

The object returned by the functions Bsiland() and Fsiland() displays the parameters estimation and a test of the global landscape effect. The function summary() applied on the result object provides significance tests of the intensity of the effect of explanatory variables (local or landscape).

The two methods are based on likelihood maximization. The functions Bsiland.lik() and Fsiland.lik() aim to point out some optimization problems. They provide representations of the minus loglikelihood in function of buffer radius or mean distance of the SIF, respectively (see SI Fig. 1). Values of minus loglikelihood lower than the estimated one indicate that the estimation procedure did not perform correctly. In such cases, the estimation needs to be reiterated using different initial values of effects scales (with argument init of functions Fsiland() and Bsiland()).

Graphics outputs

The package siland proposes various functions for graphic representation of estimated landscape effects. The function “plotFsiland.land” displays landscape effect map for the estimations of the Fsiland method (Fig. 3). The influence of each landscape variable is plotted over all the study area. The global influence of the landscape, i.e. the sum of the effects of all landscape variables can also be plotted. The function plotBsiland.land() displays landscape effect map for the estimation of the Bsiland method (Fig. 2). For each landscape variable, the estimated area of effect (buffer of estimated radius) is plotted around each observation locations. The color of the area represents the estimated intensity of the effect i.e. the effect value multiplied by the proportion of the landscape variable in the area of effect.

Interpreting spatial influence functions (SIF)

The SIF function describes how the influence of a pixel/cell of a landscape variable is spatially distributed. We assume that the influence is maximal at the pixel location and decreases with the distance. The greater the estimated mean distance of the SIF, the greater the scale of effect of the landscape variable. The estimated SIF can be displayed using the function plotFsiland.sif() (see Fig. 1). The function Fsiland.quantile() allows to quantify the area of medium influence and significant influence of a landscape variable, that we defined as the disc containing 50% and 95% of the influence of the SIF (neglecting 50% and 5% of its broader effect) respectively. They can be compared to the landscape variables distribution in the study area using function plotFsiland() (see Fig. 1). Note that a Fsiland model with an uniform SIF is similar to Bsiland model. However, in siland package the Bsiland() function calculates the exact percentage of the landscape variable in buffers, when Fsiland() is based on an approximation, i.e. the rasterization of the landscape. As a consequence, the estimation with Fsiland is often faster than with Bsiland but may be erroneous if the definition of the raster is not precise enough.

Conclusion

Estimating scales of effect of landscape variables currently remains a great challenge, and consequently so does the estimation of landscape effect. So far no common methodology has emerged. With the siland package, we propose a tool that we believe is useful for all landscape ecologists who wish to investigate this type of question. Here, we have illustrated how with only few siland functions and limited knowledge of R, it is possible to conduct a reproducible and detailed study of multi-scale landscape effect including estimations but also tests and graphs illustrating the results obtained. The “siland” package is very adaptable, it integrates two methods and can handle various types of data (continuous, binary, discrete..). Applications of siland in a multiyear or multisite framework are already available for the buffers method and presented in the vignette(“siland”). However future developments are still needed to handle the increasing complexity of questions and data (numerous sites, years and dynamic data).

Models and methods

We consider a response variable measured at n different sites denoted Y_i (i stands for a site), L local variables which can be continuous or discrete and are denoted as x_i^l (l stands for a local variable and i for a site) and K landscape variables denoted as z_r^k (k stands for a landscape variable and r for a polygon in the landscape). In the Bsiland method, the effect of landscape variables is modelled using buffers with $p_{{i},\delta_{k}}^{k}$, the percentage of the landscape variable k in a buffer of radius δ_k and centered on site i. Since the Bsiland model is based on the generalized linear models framework, the expected value of the response variable Y_i is modelled as follows:

$$ \mu_{i} = \mu + \sum\limits_{l \in L} {\alpha_{l} x_{i}^{l} } + \sum\limits_{k \in K} {\beta_{k} p_{{i},\delta_{k}}^{k} } $$

(1)

where µ is the intercept, α_l and β_k are the effects of local and landscape variables, respectively.

The Fsiland method is based on Spatial Influence Functions (SIFs) in a similar framework to Chandler & Hepinstall-Cymerman ⁹. To simplify computations, the entire study area is not considered as continuous but rasterized, i.e. pixelated on a regular grid, named R. The value of each landscape variable k at a pixel r is described in z_r^k. For instance, if the landscape variable k is a presence/absence variable, z_r^k is equal to one or zero. The expected value of the response variable Y_i is then modelled as follows:

$$ \mu_{i} = \mu + \sum\limits_{l \in L} {\alpha_{l} x_{i}^{l} } + \sum\limits_{k \in K} {\beta_{k} } \sum\limits_{r \in R} {f_{{\delta}_{k}} (d_{i,r} )z_{r}^{k} } $$

(2)

where f_δk(.) is the SIF associated with the landscape variable k and d_i,r is the distance between the center of pixel r and the observation at site i. The SIF is a density function decreasing with the distance. The scale of effect of a landscape variable k is calibrated through the parameter δ_k, the mean distance of f_δk. Two families of SIF are currently implemented in the siland package, exponential and Gaussian families defined as f_δ(d) = 2/(πδ²)exp(-2d/δ) and f_δ(d) = 1/(2δ)²exp(-π(d/2δ)²), respectively¹⁹. The effect of a landscape variable k is modelled by two parameters: an intensity parameter, β_k describing its strength and its direction and a scale parameter, δ_k, describing how this effect declines with distance. Each pixel potentially has an effect on the response variable at any observation site. No set of scales of effects is initially determined. In Eq. 2, the sum on the regular grid R is an approximation of the integration on the continuous study area. The choice of the grid definition is a tradeoff between computing precision and computing time. The smallest the mesh size of the grid is, the better are the precision but the longer the computing time is (and the larger the required memory size is). The parameters estimation may be very sensitive to this mesh size. To obtain a reliable estimation, we recommend to ensure, after the estimation procedure, that mesh size is at least three times smaller than the smallest estimated SIF (see Supplementary Fig. S2 online for details). If not, it is recommended to proceed with a new estimation with a smaller mesh (by using the wd argument of the Fsiland function, set at 30 by default).

All parameters, µ, {α₁,…, α_K}_, {β₁,…, β_K} but also {δ₁,…, δ_K} are simultaneously estimated by likelihood maximization for both Bsiland and Fsiland methods. We have developed a sequential algorithm. At the initialization stage, values are arbitrarily defined for the {δ_1,..,δ_K} scales parameters. In step A, the µ, {α₁,.., α_K}_, {β₁,.., β_K} parameters are estimated using the classical maximization procedures implemented in the lm and glm functions knowing the fixed values of the scale parameters. In step B, the scale parameters are estimated by likelihood maximization knowing the parameters estimated in step A. The values of the scale parameters are then fixed at the new estimated values. Steps A and B are thus repeated until the relative increase in likelihood decreased below a threshold or the maximum number of repetitions is reached. Tests performed (obtained using the summary function) are similar to those given by summary.lm or summary.glm function (see R Core Team¹⁶ for details, this implies that tests are given conditionally to the estimated scale parameters.).

References

García, D., Zamora, R. & Amico, G. C. The spatial scale of plant–animal interactions: effects of resource availability and habitat structure. Ecol. Monogr. 81(1), 103–121. https://doi.org/10.1890/10-0470.1 (2011).
Article Google Scholar
Remm, J., Hanski, I. K., Tuominen, S. & Selonen, V. Multilevel landscape utilization of the Siberian flying squirrel: scale effects on species habitat use. Ecol. Evol. 7(20), 8303–8315. https://doi.org/10.1002/ece3.3359 (2017).
Article PubMed PubMed Central Google Scholar
Rusch, A. et al. Agricultural landscape simplification reduces natural pest control: a quantitative synthesis. Agr. Ecosyst. Environ. 221, 198–204. https://doi.org/10.1016/j.agee.2016.01.039 (2016).
Article Google Scholar
Miguet, P., Jackson, H. B., Jackson, N. D., Martin, A. E. & Fahrig, L. What determines the spatial extent of landscape effects on species?. Landsc. Ecol. 31(6), 1177–1194. https://doi.org/10.1007/s10980-015-0314-1 (2016).
Article Google Scholar
Smith, A. C., Fahrig, L. & Francis, C. M. Landscape size affects the relative importance of habitat amount, habitat fragmentation, and matrix quality on forest birds. Ecography 34(1), 103–113. https://doi.org/10.1111/j.1600-0587.2010.06201.x (2011).
Article Google Scholar
Thornton, D. H., Branch, L. C. & Sunquist, M. E. The influence of landscape, patch, and within-patch factors on species presence and abundance: a review of focal patch studies. Landsc. Ecol. 26(1), 7–18. https://doi.org/10.1007/s10980-010-9549-z (2011).
Article Google Scholar
Bradter, U., Kunin, W. E., Altringham, J. D., Thom, T. J. & Benton, T. G. Identifying appropriate spatial scales of predictors in species distribution models with the random forest algorithm. Methods Ecol. Evol. 4(2), 167–174. https://doi.org/10.1111/j.2041-210x.2012.00253.x (2012).
Article Google Scholar
Jackson, H. B. & Fahrig, L. Are ecologists conducting research at the optimal scale?. Glob. Ecol. Biogeogr. 24(1), 52–63. https://doi.org/10.1111/geb.12233 (2015).
Article Google Scholar
Chandler, R. & Hepinstall-Cymerman, J. Estimating the spatial scales of landscape effects on abundance. Landsc. Ecol. 31(6), 1383–1394. https://doi.org/10.1007/s10980-016-0380-z (2016).
Article Google Scholar
Moilanen, A. & Hanski, I. On the use of connectivity measures in spatial ecology. Oikos 95(1), 147–151. https://doi.org/10.1034/j.1600-0706.2001.950116.x (2001).
Article Google Scholar
Aue, B., Klemens, E., Stefan, H. & Volkmar, W. Distance weighting avoids erroneous scale effects in species-habitat models. Methods Ecol. Evol. 3(1), 102–111. https://doi.org/10.1111/j.2041-210X.2011.00130.x (2011).
Article Google Scholar
Henry, M. et al. Spatial autocorrelation in honeybee foraging activity reveals optimal focus scale for predicting agro-environmental scheme efficiency. Ecol. Model. 225, 103–114. https://doi.org/10.1016/j.ecolmodel.2011.11.015 (2012).
Article Google Scholar
Serckx, A. et al. Bonobo nest site selection and the importance of predictor scales in primate ecology. Am. J. Primatol. 78(12), 1326–1343. https://doi.org/10.1002/ajp.22585 (2016).
Article PubMed Google Scholar
Walsh, C. J. & Webb, J. A. Spatial weighting of land use and temporal weighting of antecedent discharge improves prediction of stream condition. Landsc. Ecol. 29(7), 1171–1185. https://doi.org/10.1007/s10980-014-0050-y (2014).
Article Google Scholar
Huais, P. Y. multifit: an R function for multi-scale analysis in landscape ecology. Landsc. Ecol. 33(7), 1023–1028. https://doi.org/10.1007/s10980-018-0657-5 (2018).
Article Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.R-project.org. (2019)
Ricci, B. et al. The influence of landscape on insect pest dynamics: a case study in southeastern France. Landsc. Ecol. 24(3), 337–349. https://doi.org/10.1007/s10980-008-9308-6 (2009).
Article Google Scholar
Pebesma, E. Simple features for R: standardized support for spatial vector data. The R Journal 10(1), 439–446. https://doi.org/10.32614/RJ-2018-009 (2018).
Article Google Scholar
Austerlitz, F. et al. Using genetic markers to estimate the pollen dispersal curve. Mol. Ecol. 13(4), 937–954. https://doi.org/10.1111/j.1365-294X.2004.02100.x (2004).
Article PubMed Google Scholar

Download references

Acknowledgements

We are grateful to Claire Lavigne and Pierre Franck for their case study, which illustrates the use of the siland package, and for their useful advice. We are thankful to Etienne Klein for his relevant comments. This work was financially supported by the Metaprogram SMACH (Sustainable Management of Crop Health; http://www.smach.inra.fr/) of the France's National Research Institute for Agriculture, Food and Environment (INRAE).

Author information

Authors and Affiliations

Université Paris-Saclay, INRAE, UR MaIAGE, 78350, Jouy-en-Josas, France
Florence Carpentier
AgroParisTech, 75005, Paris, France
Florence Carpentier
INRAE, BioSP, 84914, Avignon, France
Olivier Martin

Authors

Florence Carpentier
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Martin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.C. and O.M. conceived the ideas and designed methodology, programmed the R package. All authors contributed critically and equally to the drafts and gave final approval for publication.

Corresponding author

Correspondence to Florence Carpentier.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Carpentier, F., Martin, O. Siland a R package for estimating the spatial influence of landscape. Sci Rep 11, 7488 (2021). https://doi.org/10.1038/s41598-021-86900-0

Download citation

Received: 15 October 2020
Accepted: 18 March 2021
Published: 05 April 2021
DOI: https://doi.org/10.1038/s41598-021-86900-0

This article is cited by

‘Scalescape’: an R package for estimating distance-weighted landscape effects on an environmental response
- Erin B. Lowe
- Ben Iuliano
- Anthony R. Ives
Landscape Ecology (2022)
Pollination efficiency in farmland landscapes: exploring the relative roles of spillover, dilution and complementarity between habitats
- Maxime Ragué
- Vincent Bretagnolle
- Sabrina Gaba
Landscape Ecology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.