An Adaptive Weighting Algorithm for Interpolating the Soil Potassium Content

Liu, Wei; Du, Peijun; Zhao, Zhuowen; Zhang, Lianpeng

doi:10.1038/srep23889

Download PDF

Article
Open access
Published: 07 April 2016

An Adaptive Weighting Algorithm for Interpolating the Soil Potassium Content

Wei Liu^1,2,
Peijun Du¹,
Zhuowen Zhao² &
…
Lianpeng Zhang²

Scientific Reports volume 6, Article number: 23889 (2016) Cite this article

2014 Accesses
12 Citations
Metrics details

Subjects

Abstract

The concept of spatial interpolation is important in the soil sciences. However, the use of a single global interpolation model is often limited by certain conditions (e.g., terrain complexity), which leads to distorted interpolation results. Here we present a method of adaptive weighting combined environmental variables for soil properties interpolation (AW-SP) to improve accuracy. Using various environmental variables, AW-SP was used to interpolate soil potassium content in Qinghai Lake Basin. To evaluate AW-SP performance, we compared it with that of inverse distance weighting (IDW), ordinary kriging, and OK combined with different environmental variables. The experimental results showed that the methods combined with environmental variables did not always improve prediction accuracy even if there was a strong correlation between the soil properties and environmental variables. However, compared with IDW, OK, and OK combined with different environmental variables, AW-SP is more stable and has lower mean absolute and root mean square errors. Furthermore, the AW-SP maps provided improved details of soil potassium content and provided clearer boundaries to its spatial distribution. In conclusion, AW-SP can not only reduce prediction errors, it also accounts for the distribution and contributions of environmental variables, making the spatial interpolation of soil potassium content more reasonable.

Global Soil Hydraulic Properties dataset based on legacy site observations and robust parameterization

Article Open access 25 July 2022

The SMOS-Derived Soil Water EXtent and equivalent layer thickness facilitate determination of soil water resources

Article Open access 27 October 2020

African soil properties and nutrients mapped at 30 m spatial resolution using two-scale ensemble machine learning

Article Open access 17 March 2021

Introduction

The continuous spatial distribution of the soil plays a significant role in the fields of agriculture and environmental management^1,2. Scientists and agricultural managers often require continuous data of soil properties over a region of interest for making informed decisions and justified interpretations. However, such data are usually not readily available and often difficult and expensive to acquire, especially for high-altitude mountainous regions. Moreover, soil property data are usually collected by point sampling. Thus, attribute values at unsampled points require estimation for the generation of spatially continuous data of soil properties. Therefore, spatial interpolation techniques are essential for predicting the continuous data of soil properties for unsampled locations using data from limited point observations.

Existing spatial interpolation methods can be largely classified into three groups³: 1) deterministic or non-geostatistical methods (e.g., inverse distance weighting [IDW]); 2) stochastic or geostatistical methods (e.g., universal kriging [UK]); and 3) combined methods (e.g., ordinary kriging [OK] with environmental variables [OK-Geo]). Kriging is a geostatistical interpolation method that provides the best linear unbiased estimation for the interpolation result, but demands the higher data stability, such as meeting a second-order stationary assumption. Non-geostatistical interpolation methods such as IDW interpolation, which assume that each sampling point on the local impact with the increase in distance gradually disappears, do not have the statistical advantage despite its simple operation. However, such methods are often data- or even variable-specific, and their performance depends on many factors⁴. No consistent findings have shown how these factors affect spatial prediction method performance. Some researchers found that kriging method outperformed IDW and other methods because of a careful choice of the optimal number of neighboring points as well as the variogram model and its parameters^{3,5,6,7,8,9,10,11}; however, the others showed that kriging was not better than the other methods^{4,12,13,14,15}. Therefore, it is often a challenge to select an appropriate spatial interpolation method for a given dataset.

Machine learning methods have been applied to the fields of data mining and spatial interpolation and have demonstrated their predictive accuracy, e.g., artificial neural networks (ANN), support vector machine (SVM), ensemble learning(EL), and random forest (RF)^10,16,17,18. Furthermore, ANN and SVM have been applied to daily minimum air temperature and rainfall data in the studies by Gilardi¹⁹ and Rigol et al.²⁰. EL and RF were previously applied to the spatial interpolation of environmental variables by Liu²¹ and Li²². However, this is a kind of global interpolation model, a simple iteration of which cannot explain the spatial instability of soil properties.

AW-SP, a machine learning paradigm in which multiple learners are trained to solve the same problem, originated from Hansen and Salamon’s work²³, which showed that the generalization ability of a model system can be significantly improved through use of a number of models, i.e. training many models and then combining their predictions. Since this technology performs remarkably well, it has become a very hot topic in machine learning communities²⁴. In contrast to ordinary interpolation approaches that try to use one interpolation model based on training data, AW-SP try to construct a set of interpolation models and combine them. As in the axiom ‘many hands make light work’, the predictive ability of the ensemble is usually significantly better than that of a single model²⁴.

The spatial distribution of soil is also greatly affected by environmental features including land use type, soil type, geological type, and slope, etc.^22,25. Soil property data may vary significantly within a short horizontal distance because of the different soil environment, it is difficult to accurately interpolate soil property distributions in the absence of obvious environment features. Therefore, spatial interpolators combining environment features in pedometrics and digital soil have been studied by increasingly more researchers^{5,25,26,27,28,29,30,31,32}. We used the secondary variables in an effort to improve interpolation accuracy.

In this study, we aim to address the following questions: 1) can ensemble of these existing spatial interpolation methods with environmental variables improve the predictive accuracy? 2) can describe spatial distribution pattern of soil potassium content more accurately based on difference environmental variables? To address the two questions, we applied the ensemble learning methods and a number of existing spatial interpolation methods including IDW, OK and the combined methods (e.g., OK combined with environmental variables) to soil potassium content we collected from around the Qinghai Lake in September, 2013.We examined the effects of the performance of AW-SP, IDW, OK and its combined methods (Table 1). Finally, the prediction patterns of the interpolation methods were analyzed based on their prediction maps.

Table 1 Methods compared for predicting soil potassium content in this study.

Full size table

Results

Comparison of interpolation performance

To assess the accuracy of AW-SP for interpolating soil potassium content, we compared the performances of AW-SP, IDW, OK, OK-LU, OK-Soil, OK-Grassland, OK-Geology, and OK-Geo. We calculated the mean absolute error (MAE), root mean square error (RMSE), and mean error (ME) as measures of interpolation quality by comparing the predicted and measured values (Table 2). We found that interpolation combined with environmental variables, including AW-SP, OK-LU, OK-Soil, OK-Grassland, and OK-Geo (i.e., excluding OK-Geology) obviously improved interpolation precision. AW-SP in particular was the most accurate method. OK and IDW had similar performance. AW-SP could achieve a slightly better MAE (0.1003) than those of OK-Geo (0.1284), OK-LU (0.1376), OK-Soil (0.1381), OK-Grassland (0.1387), OK (0.1485), IDW (0.1487), and OK-Geology (0.1521). Similarly, the RMSE for AW-SP (0.1374) was the smallest, followed by OK-Geo (0.1732), OK-LU (0.1741), OK-Soil (0.1754), OK-Grassland (0.1797), OK (0.1838), IDW (0.1872), and OK-Geology (0.2022). The ME of AW-SP (0.0000) was much smaller than those of OK-Geo (0.0011), OK-LU (0.0017), OK-Soil (0.0015), OK-Grassland (0.0012), OK (0.0026), OK-Geology (0.0026), and IDW (−0.0030).

Table 2 Comparisons of the accuracy among IDW, OK, OK-LU, OK-Soil, OK-Grassland, OK-Geology, OK-Geo and AW-SP.

Full size table

Effects of the exclusion of slope and geology

The prediction errors of the two combined with all environmental variables methods (i.e., AW-SP and OK-Geo) are reduced after the exclusion of slope and geology information in terms of RMAE, RRMSE and ME, although p-values change with the methods and with predictive error measurements (Table 3). Overall, the methods without slope and geology information are relatively more accurate than those with slope and geology information.

Table 3 Effects of slope and geology exclusion on the prediction error of AW-SP and OK-Geo.

Full size table

Comparison of the interpolated maps

Applied 8 methods to interpolate the soil potassium content (i.e., AW-SP, OK-Geo, OK-LU, OK-Soil, OK-Grassland, OK-Geology, OK and IDW) are illustrated in Fig. 1(a–h). The spatial patterns of AW-SP and OK-Geo captured similar major spatial patterns and trends of soil potassium content but had evident smooth surface patterns and could not more accurately describe the local variation in OK-Geo, for which the simulated results ranges were somewhat narrower in the predictions. The OK produced a map similar to those of OK-Geo and AW-SP but with the smoothing effect and weak “bull’s eye” patterns. The predictions of IDW displayed similar major patterns but failed to predict the changes in local variation and displayed strong “bull’s eye” patterns at high and low sample points. The predictions of OK-LU, OK-Soil, OK-Grassland, and OK-Geology combined with environmental variables eliminated the smoothing effect of OK and had significant variation in the different abrupt boundary types. For example, in the soil potassium content interpolation, combining land use information, OK-Landuse gave more details of soil potassium content distribution in different land use types, especially in the abrupt boundary. In the opposite, soil potassium content values of OK and IDW interpolation map did not have the discrete information.

Therefore, the interpolators that considered the environmental variables can more accurately describe the local variation. These results showed that combining appropriate environmental variables (excluding the geology types) as a secondary variable could significantly improve the local variation interpolation performance.

Discussion

The performance of ensemble learning for spatial interpolation

Kriging usually outperforms IDW and is generally superior, at least in theory³. However, in this study, kriging performed similarly to IDW (e.g., OK) or less well (e.g., OK-Geology). A similar finding was also reported by Collins and Bolstad¹², who found that optimal IDW was superior to kriging when the data were isotropic and when the primary variable was not correlated with the secondary variable. In this study, even though the correlations between the primary variable and secondary variables were strong, suggesting a strong spatial trend, kringing (e.g., OK-Geology) was not always superior to IDW, at least for soil potassium content.

In contrast to ordinary interpolation methods (e.g., OK) that attempt to generate one learner from the training data, the ensemble method tries to construct a set of base learners and then to combine them. The interpolation accuracy of an ensemble is usually much better than that of a single interpolation model, which makes ensemble methods very attractive. The superior performance of AW-SP in this study could be attributed to the following factors associated with the methods.

The training set of soil potassium content might not provide sufficient information for selection of the single best interpolation model. For example, there might be many interpolation models that perform equally well with the given training set. Thus, combining these interpolation models (e.g., OK-LU, OK-Soil, and OK-Grassland) might be a better choice.

The training set of soil potassium content being interpolated might not contain the true spatial pattern, while the ensembles can provide a good approximation. For example, it is known that for the same piece of rainfed cropland, the soil potassium content for chernozem soil type will be quite different to that of a sandy soil region. Therefore, if land use type is only considered as secondary variables, the use of a single OK-LU method will not lead to a good interpolation, whereas a better approximation could be achieved by combining a set of interpolation methods (e.g., OK-LU, OK-Soil, and OK-Grassland).

The predictions of AW-SP are more reasonable for the extrapolation of soil potassium content in this study and more accurate than OK, IDW, and OK-Geo.

The effectiveness of secondary variables for reducing predictive error

The distribution of soil properties is controlled by several environmental variables, such as land use, soil type, and slope³³. The soil property distribution could vary significantly within small spatial scales because of different soil environment types, which can make it difficult to obtain accurate interpolations using AW-SP when such obvious secondary variables are ignored. Therefore, environmental variables should be combined with AW-SP to improve interpolation efficiency.

Different types of land use, soil, geology, slope, and vegetation cover all have an effect on land surfaces. Several studies have indicated that environmental variables are significantly related to the spatial distribution of soil properties^2,34,35. Hu et al.³⁵ and Shi et al.² have shown that the spatial distribution pattern of soil properties has strong correlation with different environmental variables^2,35.

A comparison between the accuracy of those methods that use secondary information (i.e., AW-SP, OK-Geo, OK-LU, OK-Soil, and OK-Grassland except OK-Slope and OK-Geology) and that of the methods that do not use secondary information (i.e., OK and IDW) shows that the use of secondary information improves the accuracy of the interpolation. This finding is consistent with previous studies that have shown that stronger correlations result in more accurate predictions when using ordinary cokriging³⁶ and ordinary cokriging over OK³⁷. It was also reported that a threshold exists because for a correlation >0.4 simple cokriging and ordinary cokriging performed better than simple kriging and OK, for the stronger correlations (r >0.75), the methods those uses the information available on the secondary variable are more superior to OK³⁸. However, the OK-Geology and OK-Slope methods are two exceptions, despite the fact that the correlation between soil potassium content and geology type was found to be strong in this study (Table 4). This suggests that the inclusion of secondary information does not always improve the prediction accuracy, which does not support the argument regarding the role of secondary variables in spatial interpolation^2,5,25. In this study, this could probably be attributed to the low density of sampling of soil potassium content.

Table 4 ANOVA for testing the effects of secondary variables on variances of soil potassium.

Full size table

Limitations

The limitation of AW-SP is that it has a smoothing effect and that the surface variation is smaller than the ensemble object values. If the ensemble results for an object are smaller than the measured value, the AW-SP results will be lower than the measured value. The current research method used ‘tandem’ ensemble interpolation models, incorporating a global interpolation model for the entire study area, although a simple global model cannot explain the spatial instability of soil properties. In future, we will use ‘parallel’ ensemble interpolation models, based on the different regional characteristics of the study area and with consideration of the problems of simulation scale, to select the appropriate interpolation model integration.

Methods

Section 4.1 describes the AW-SP process. The AW-SP is constructed in two steps. First, a number of base learners are produced (e.g., OK-LU, OK-Soil, and OK-Grassland). Then, the base learners are combined using a popular combination scheme. Section 4.2 shows the interpolation parameter specifications. Section 4.3 shows how to select the secondary variables. Section 4.4 shows how to assess interpolation performance.

AW-SP

Here we constructed the interpolation model (i.e., OK-LU, OK-Soil, OK-Grassland, and OK-Geology) as the base learner of the AW-SP. As a kind of geostatistical model^31,38, each observation Z(x_m,n, y_m,n) of a specific soil potassium content at location (x, y) in the n-th type of the m-th kind of environmental feature can be expressed as equation (1).

where M(E_m,n) is the mean value of Z(x_m,n, y_m,n) in the n-th type of the m-th kind of environmental feature, and R(x_m,n, y_m,n) is the residual computed by subtracting the mean value M(E_m,n) of the n-th type of the relative m-th environmental feature from the measured value of soil potassium content. We assume that M(E_m,n) and R(x_m,n, y_m,n) are mutually independent and that the variation of R(x_m,n, y_m,n) is homogeneous over the entire study area.

The residuals of the relevant types of environment features are then used to interpolate the surface of the residuals in the entire study area by OK. The interpolated values of residuals are finally summed to the mean values of soil potassium content as one base learner of AW-SP. The construction of base learner framework is shown in Fig. 2 and the steps are described in the below.

(1) Based on the ANOVA analysis, we analyzed the environmental features that affected the spatial distribution pattern of soil potassium content most significantly and chose the environmental features (m) which was most related to soil potassium content.
(2) Based on the measured values of soil potassium content, we calculated the mean and residuals of the soil potassium content values for each type (n) of environmental feature (m).
(3) According to the spatial distribution pattern of the mean values of soil potassium content related to the environmental features (m), we converted the environmental features (m) to 30 m resolution grids according to the mean values of each type (n) using the modules of the Conversion Tools of ArcGIS 10.1, and mapped the overall distribution pattern of soil potassium content M(E_m,n) for each type (n).
(4) Based on the related residuals, we used OK to simulate the remaining residual surface of soil potassium content R(x_m,n, y_m,n).
(5) We added the mean surface and residual surface to obtain the m-th environmental feature related to the interpolation surface (e.g., OK-LU) as the ensemble learning of the base learners.

Second, the ensemble learning framework algorithm (see Supplementary Methods) was used to integrate all of the interpolation models (e.g., OK-LU, OK-Soil, and OK-Grassland) as the AW-SP simulation result. The steps are described below.

(1) The ensemble learning algorithm assigned equal weights to all the sampling points of the training soil potassium content data, and the distribution of the weights was denoted at the t-th learning round as D_t.
(2) From the training dataset and D_t, the ensemble learning framework algorithm chooses a base learner h_t (e.g., OK-LU) by calling the base learning algorithm.
(3) Then, it used verification points to test h_t (see Supplementary Methods the equation 1) and increased the weights of incorrectly interpolated points. Thus, an updated weight distribution D_t+1 was obtained (see Supplementary Methods the equation 3).
(4) From the training dataset and D_t+1, the algorithm used another base learner (e.g., OK-Soil) by calling the base learning algorithm again.
(5) Such a process was repeated T times (T depends on the number of base learner) and the final learner derived by weighted averaging of the T base learners, where the weights of the learners were determined during the training process (see Supplementary Methods the equation 2).

Parameter specification

The parameter specifications were based on the requirements of the interpolation methods and data characteristics. Based on the fitted values of range, nugget, and sill, the variogram model was selected from a series of models including Spherical, Exponential, Gaussian, Hole effect, and Linear models. For OK and its combined methods, the Gaussian and J-Bessel models were selected as they better fitted the data and the residuals of the relevant methods than other variogram models in terms of range, nugget, and sill (Fig. 3 and Table 5). We chose the best kriging sample from 5 to 30 with five-step intervals. IDW was estimated with powers of 1, 2, 3, and 4.

Table 5 Semi-variagram models.

Full size table

Analyses of the spatial correlation of residuals reflected good performance after removing the local mean within the different secondary variables (Fig. 3). All of the semi-variograms of the residuals tended to show a shorter range and a smaller sill, which indicated that the drift had indeed been removed²⁷. All of the N/S (except OK) were <0.3, indicating that the mean sample data has strong spatial correlation³⁹, after trend removal, the spatial correlation was stronger (Table 5). This finding suggests that the use of OK and its combined methods is appropriate for this study region.

Secondary variable selection

Analysis of variance (ANOVA) was performed to analyze the significance of the secondary variables in soil potassium content (Table 4). Take land use types for example: To compare the difference of soil potassium content among land use types, the soil potassium content data were grouped into seven classes based on main land use type. There were 11, 78, 15, 36, three, one, and four samples from rainfed cropland, natural grazing land, tame grassland, other land, scrubland, other grassland, and sandy land, respectively. The variances of each soil potassium content between and within land use types were determined using ANOVA with the software package SPSS 21.0 for Windows. It was established that land use, soil, grassland, and geology types were the four strongest variables correlated with soil potassium content (all significant at the 0.01 level). Based on intuition and other references^22,40, it was considered likely that slope would have some influence on the transfer of soil potassium content from the high-slope regions. Therefore, slope was also considered as an important secondary variable in this study. However, slope was eventually excluded because of its lack of correlation with soil potassium content (The slope varied only slightly across the study area, most slopes were between 0° and 8°).With the exception of the geology type, they were used as secondary variables in the AW-SP and OK-Geo methods. Geology was dropped because its performance when combined with OK was worse than OK alone, despite the strong correlation found in this study between soil potassium content and geology type. The types of land use, soil, grassland, and geology were used in the OK-LU, OK-Soil, OK-Grassland, and OK-Geology methods, respectively. The OK and IDW methods do not need secondary variables.

Assessment of the performance

Independent verification was used for the validation of the interpolators in this study. The procedure involves randomly splitting the data into the interpolation and validation subsets, estimating the value using interpolation subset and comparing the interpolated value at every validation point with its measured value. A total of 120 training points were randomly created as interpolation data sets, and the remaining 28 samples were used as validation data sets.

The performance of these interpolation methods was assessed by identifying the error in the predictions. We used the three most common indices, i.e., the mean absolute error (MAE), the root mean square error (RMSE) and the mean error (ME) as measures of the interpolation quality. The formulations of the MAE, RMSE and ME are as below equation (2), (3) and (4).

where z*(xi) is the measured value, z(xi) is the predicted value, and n is the number of validation points. The values of the two indices should be close to zero if the method is completely accurate. In comparison, RMSE is sensitive to the size of outliers and it is used as an indicator of the magnitude of extreme errors. Lower values of RMSE indicate greater central tendency and generally smaller extreme errors⁴¹.

Data and Study Area

Study area

The study area (36°27′51″–37°30′43″N, 99°55′29″–101°05′03″E) is located in the southeast region of the Qinghai Lake Basin. The region covers an area of 7425.61 km² of which 4473.96 km² is water and the land elevation ranges from 3043 to 4516 m. According to the 1:1,000,000 soil maps of the National Soil Census Office, there are eight groups of soil type (Fig. 4b), and according to the 1:500,000 geological map of Qinghai Province from the Qinghai Provincial Bureau of Geology and Mineral Resources, the principal geological type include alluvial terraces, rolling alpine, and valley plain, etc. (Fig. 4c). Land use type are rainfed cropland, natural grazing land, tame grassland, other land, scrubland, other grassland, sandy land and water body, etc. (Fig. 4a), and the grassland can be classified into twenty-three groups (Fig. 4d). Slope type is not considered in this study because of their poor correlation with the soil potassium content (Table 4).

Datasets

We collected a total of 148 topsoil samples (0–30 cm) from the study area in September 2013. We also recorded the soil sample locations, land use type, soil type, grassland type, geology type, and elevation. We analyzed the landscape in the study area using the spatial stratified sampling method⁴². Each position was sampled three times and the average was recorded as the sample values. Each sample was air-dried and passed through a 2-mm sieve to determine the soil potassium content.

Many environmental variables can be used as secondary variables to improve the performance of spatial interpolation methods as discussed by Li and Heap³ and Shi et al.². Following a preliminary analysis, the land use, soil, grassland, and geology types were considered important secondary information in this study. The land use, soil, grassland, and geology types were previously used to improve the performance of the spatial interpolators of soil properties²⁵. Therefore, the inclusion of such environmental variables was expected to improve the predictions. ANOVA analysis (Table 4) revealed that variances of the tested soil potassium content among different secondary variables (except for slope) might play an important role in their spatial prediction in the study area. All datasets of environmental variables were generated in ArcGIS10.1 and resampled to 30-m resolution wherever necessary. However, the small sample size and uneven spatial distribution of the soil samples (Fig. 4) indicate that sub-setting by feature types, leading to some features without soil samples, cannot provide an adequate soil sample size for modeling; as such, we used the secondary variables to improve the interpolation accuracy.

Additional Information

How to cite this article: Liu, W. et al. An Adaptive Weighting Algorithm for Interpolating the Soil Potassium Content. Sci. Rep. 6, 23889; doi: 10.1038/srep23889 (2016).

References

Bishop, T. & McBratney, A. A comparison of prediction methods for the creation of field-extent soil property maps. Geoderma. 103, 149–160 (2001).
Article ADS Google Scholar
Shi, W., Liu, J., Du, Z., Stein, A. & Yue, T. Surface modelling of soil properties based on land use information. Geoderma. 162, 347–357 (2011).
Article CAS ADS Google Scholar
Li, J. & Heap, A. D. In How to reference books. Vol. 137 (eds Li, J. et al.) Ch. 2, 4–5 (Geoscience Australia, 2008).
Google Scholar
Li, J. & Heap, A. D. A review of comparative studies of spatial interpolation methods in environmental sciences: performance and impact factors. Ecol. Inform. 6, 228–241 (2011).
Article Google Scholar
Hengl, T., Heuvelink, G. & Rossiter, D. G. About regression-kriging: from equations to case studies. Comput . Geosci-uk. 33, 1301–1315 (2007).
Article Google Scholar
Hosseini, E., Gallichand, J. & Marcotte, D. Theoretical and experimental performance of spatial interpolation methods for soil salinity analysis. T. Asae. 37, 1799–1807 (1994).
Article Google Scholar
Kravchenko, A. & Bullock, D. G. A comparative study of interpolation methods for mapping soil properties. Agron. J. 91, 393–400 (1999).
Article Google Scholar
Laslett, G., McBratney, A., Pahl, P. J. & Hutchinson, M. Comparison of several spatial prediction methods for soil pH. Eur. J. Soil. Sci. 38, 325–341 (1987).
Article CAS Google Scholar
Leenaers, H., Okx, J. & Burrough, P. Comparison of spatial prediction methods for mapping floodplain soil pollution. Catena. 17, 535–550 (1990).
Article Google Scholar
Marmion, M., Parviainen, M., Luoto, M., Heikkinen, R. K. & Thuiller, W. Evaluation of consensus methods in predictive species distribution modelling. Divers. Distrib. 15, 59–69 (2009).
Article Google Scholar
Panagopoulos, T., Jesus, J., Antunes, M. & Beltrao, J. Analysis of spatial interpolation for optimising management of a salinized field cultivated with lettuce. Eur. J. Agron. 24, 1–10 (2006).
Article Google Scholar
Collins, F. C. In How to reference books. (ed. Collins, F. C. ) 190–199 (National Center, 1996).
Gotway, C. A., Ferguson, R. B., Hergert, G. W. & Peterson, T. A. Comparison of kriging and inverse-distance methods for mapping soil parameters. Soil. Sci. Am. J. 60, 1237–1247 (1996).
Article CAS Google Scholar
Weber, D. & Englund, E. Evaluation and comparison of spatial interpolators. Math. Geol. 24, 381–391 (1992).
Article Google Scholar
Wollenhaupt, N., Wolkowski, R. & Clayton, M. Mapping soil test phosphorus and potassium for variable-rate fertilizer application. J. Prod. Agri. 7, 441–448 (1994).
Article Google Scholar
Cutler, D. R. et al. Random forests for classification in ecology. Ecology 88, 2783–2792 (2007).
Article PubMed Google Scholar
Diaz-Uriarte, R. & De Andres, S. A. Gene selection and classification of microarray data using random forest. BMC. bioinformatics. 7, 1–13 (2006).
Article Google Scholar
Drake, J. M., Randin, C. & Guisan, A. Modelling ecological niches with support vector machines. J. Appl. Ecol. 43, 424–432 (2006).
Article Google Scholar
Gilardi, N. Machine learning for spatial data analysis. A thesis for PhD, University of Lausanne and Dalle Molle Institute of Perceptual Artificial Intelligence. 33–42 (2002).
Rigol, J. P., Jarvis, C. H. & Stuart, N. Artificial neural networks as a tool for spatial interpolation. Int. J. Geogr. Inf. Sci. 15, 323–343 (2001).
Article Google Scholar
Liu, W., Du, P. J. & Wang, D. C. Ensemble Learning for Spatial Interpolation of Soil Potassium Content Based on Environmental Information. Plos One. 10, e0124383, doi: doi:10.1371/ journal.pone.0124383 (2015).
Article PubMed PubMed Central Google Scholar
Li, J., Heap, A. D., Potter, A. & Daniell, J. J. Application of machine learning methods to spatial interpolation of environmental variables. Environ. Modell. Soft. 26, 1647–1659 (2011).
Article Google Scholar
Hansen, L. K. & Salamon, P. Neural network ensembles. IEEE. T. Pattern. Anal. 12, 993–1001 (1990).
Article Google Scholar
Zhou, Z. H., Wu, J. X. & Tang, W. Ensembling neural networks: many could be better than all. Artif. Intell. 137, 239–263 (2002).
Article MathSciNet Google Scholar
Shi, W., Liu, J., Du, Z. & Yue, T. Development of a surface modeling method for mapping soil properties. J. Geogr. Sci. 22, 752–760 (2012).
Article Google Scholar
Hartemink, A. E. & McBratney, A. A soil science renaissance. Geoderma. 148, 123–129 (2008).
Article ADS Google Scholar
Hengl, T., Heuvelink, G. & Stein, A. A generic framework for spatial prediction of soil variables based on regression-kriging. Geoderma. 120, 75–93 (2004).
Article ADS Google Scholar
Kravchenko, A. & Robertson, G. Can topographical and yield data substantially improve total soil carbon mapping by regression kriging? Agron. J. 99, 12–17 (2007).
Article Google Scholar
McBratney, A. B., Odeh, I. O., Bishop, T. F., Dunbar, M. S. & Shatar, T. M. An overview of pedometric techniques for use in soil survey. Geoderma. 97, 293–327 (2000).
Article ADS Google Scholar
Minasny, B. & McBratney, A. B. Spatial prediction of soil properties using EBLUP with the Matérn covariance function. Geoderma. 140, 324–336 (2007).
Article ADS Google Scholar
Odeh, I. O., McBratney, A. & Chittleborough, D. Further results on prediction of soil properties from terrain attributes: heterotopic cokriging and regression-kriging. Geoderma. 67, 215–226 (1995).
Article ADS Google Scholar
Sumfleth, K. & Duttmann, R. Prediction of soil property distribution in paddy soil landscapes using terrain data and satellite information as indicators. Ecol. Indic. 8, 485–501 (2008).
Article Google Scholar
Borůvka, L., Mládková, L., Penížek, V., Drábek, O. & Vašát, R. Forest soil acidification assessment using principal component analysis and geostatistics. Geoderma. 140, 374–382 (2007).
Article ADS Google Scholar
Basaran, M., Erpul, G., Tercan, A. & Canga, M. The effects of land use changes on some soil properties in Indaği Mountain Pass-Çankiri, Turkey. Environ. Monit. Assess. 136, 101–119 (2008).
Article CAS PubMed Google Scholar
Hu, K., Li, H., Li, B. & Huang, Y. Spatial and temporal patterns of soil organic matter in the urban–rural transition zone of Beijing. Geoderma. 141, 302–310 (2007).
Article ADS Google Scholar
Goovaerts, P. Geostatistics in soil science: state-of-the-art and perspectives. Geoderma. 89, 1–45 (1999).
Article ADS Google Scholar
Martínez-Cob, A. Multivariate geostatistical analysis of evapotranspiration and precipitation in mountainous terrain. J. Hydrol. 174, 19–35 (1996).
Article ADS Google Scholar
Asli, M. & Marcotte, D. Comparison of approaches to spatial estimation in a bivariate context. Math. geol. 27, 641–658 (1995).
Article CAS Google Scholar
Cambardella, C. A. & Karlen, D. L. Spatial Analysis of Soil Fertility Parameters. Precis. Agric. 1, 5–14 (1999).
Article Google Scholar
Kuriakose, S. L., Devkota, S., Rossiter, D. & Jetten, V. Prediction of soil depth using environmental variables in an anthropogenic landscape, a case study in the Western Ghats of Kerala, India. Catena. 79, 27–38 (2009).
Article Google Scholar
Zhao, N. & Yue, T. A modification of HASM for interpolating precipitation in China. Theor. Appl. Climatol. 116, 273–285 (2014).
Article ADS Google Scholar
Zhang, H., Lu, L., Liu, Y. & Liu, W. Spatial Sampling Strategies for the Effect of Interpolation Accuracy. ISPRS Int. J. Geo-Inf. 4, 2742–2768 (2015).
Article Google Scholar

Download references

Acknowledgements

This study is supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (Grant No. 14KJD170001), by the Natural Science Foundation of the Jiangsu Provincial Bureau of Surveying and mapping geographic information of China (Grant No. JSCHKY201405) and by the National Key Technology Research and Development Program of the Ministry of Science and Technology of China (Grant No. 2012BAH31B00).

Author information

Authors and Affiliations

Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, Nanjing University, Nanjing, People’s Republic of China
Wei Liu & Peijun Du
School of Geodesy and Geometrics, Jiangsu Normal University, Xuzhou, People’s Republic of China
Wei Liu, Zhuowen Zhao & Lianpeng Zhang

Authors

Wei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Peijun Du
View author publications
You can also search for this author in PubMed Google Scholar
Zhuowen Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Lianpeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: W.L. and P.D. Performed the experiments: Z.Z. Analyzed the data: L.Z. Wrote the paper: W.L.

Corresponding authors

Correspondence to Wei Liu or Peijun Du.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (DOC 42 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Liu, W., Du, P., Zhao, Z. et al. An Adaptive Weighting Algorithm for Interpolating the Soil Potassium Content. Sci Rep 6, 23889 (2016). https://doi.org/10.1038/srep23889

Download citation

Received: 29 October 2015
Accepted: 15 March 2016
Published: 07 April 2016
DOI: https://doi.org/10.1038/srep23889

This article is cited by

An integrated approach for the assessment and monitoring of land degradation and desertification in semi-arid regions using physico-chemical and geospatial modeling techniques
- Pradeep Kumar Badapalli
- Raghu Babu Kottala
- Veeraswamy Golla
Environmental Science and Pollution Research (2022)
Advances of Four Machine Learning Methods for Spatial Data Handling: a Review
- Peijun Du
- Xuyu Bai
- Wei Liu
Journal of Geovisualization and Spatial Analysis (2020)
Prediction of soil cadmium distribution across a typical area of Chengdu Plain, China
- Qiquan Li
- Changquan Wang
- Yongdong Wang
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Global Soil Hydraulic Properties dataset based on legacy site observations and robust parameterization

The SMOS-Derived Soil Water EXtent and equivalent layer thickness facilitate determination of soil water resources

African soil properties and nutrients mapped at 30 m spatial resolution using two-scale ensemble machine learning

Introduction

Results

Comparison of interpolation performance

Effects of the exclusion of slope and geology

Comparison of the interpolated maps

Discussion

The performance of ensemble learning for spatial interpolation

The effectiveness of secondary variables for reducing predictive error

Limitations

Methods

AW-SP

Parameter specification

Secondary variable selection

Assessment of the performance

Data and Study Area

Study area

Datasets

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Supplementary Information (DOC 42 kb)

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

An integrated approach for the assessment and monitoring of land degradation and desertification in semi-arid regions using physico-chemical and geospatial modeling techniques

Advances of Four Machine Learning Methods for Spatial Data Handling: a Review

Prediction of soil cadmium distribution across a typical area of Chengdu Plain, China

Comments

Search

Quick links