Geoinformation-based landslide susceptibility mapping in subtropical area

Zhou, Xiaoting; Wu, Weicheng; Qin, Yaozu; Fu, Xiao

doi:10.1038/s41598-021-03743-5

Download PDF

Article
Open access
Published: 21 December 2021

Geoinformation-based landslide susceptibility mapping in subtropical area

Xiaoting Zhou¹,
Weicheng Wu¹,
Yaozu Qin¹ &
…
Xiao Fu¹

Scientific Reports volume 11, Article number: 24325 (2021) Cite this article

3551 Accesses
14 Citations
Metrics details

Subjects

Abstract

Mapping susceptibility of landslide disaster is essential in subtropical area, where abundant rainfall may trigger landslide and mudflow, causing damages to human society. The purpose of this paper is to propose an integrated methodology to achieve such a mapping work with improved prediction results using hybrid modeling taking Chongren, Jiangxi as an example. The methodology is composed of the optimal discretization of the continuous geo-environmental factors based on entropy, weight of evidence (WoE) calculation and application of the known machine learning (ML) models, e.g., Random Forest (RF), Support Vector Machine (SVM) and Logistic Regression (LR). The results show the effectiveness of the proposed hybrid modeling for landslide hazard mapping in which the prediction accuracy vs the validation set reach 82.35–91.02% with an AUC [area under the receiver operating characteristic (ROC) curve] of 0.912–0.970. The RF algorithm performs best among the observed three ML algorithms and WoE-based RF modeling will be recommended for the similar landslide risk prediction elsewhere. We believe that our research can provide an operational reference for predicting the landslide hazard in the subtropical area and serve for disaster reduction and prevention action of the local governments.

Estimate earth fissure hazard based on machine learning in the Qa’ Jahran Basin, Yemen

Article Open access 19 December 2022

Assessing and mapping multi-hazard risk susceptibility using a machine learning technique

Article Open access 21 February 2020

A multi-hazard map-based flooding, gully erosion, forest fires, and earthquakes in Iran

Article Open access 21 July 2021

Introduction

Landslide is a common geological disaster leading to destruction and damages to human society in subtropical areas. With the socioeconomic development and the continuous expansion of human activities into the natural environment, landslide occurs more and more frequently and constitutes the main disaster threatening the safety of life and restricts the economic development in the hilly and mountainous areas^1,2,3,4. Accurate and reliable mapping of landslide risk is a key step for local decision-makers and authorities to plan reasonable land use and implement disaster reduction and prevention measures to reduce the massive damage^5,6,7,8,9.

Actually, a number of scientists have been exploring reliable approaches for landslide hazard mapping^10,11. With the advent of geoinformation technology including remote sensing (RS), Geographic Information System (GIS), Global Positioning System (GPS) or Beidou System (BDS) and powerful computer processing facility, acquisition and processing of geo-environmental factors with high resolution have been greatly facilitated^8,12. The prediction of landslide hazard has been also upgraded from knowledge-driven qualitative analysis to data-driven quantitative modeling^13,14,15. The knowledge-driven model is to sort out and weight the limited landslide influencing factors based on a priori knowledge to conduct a landslide susceptibility mapping^16,17, while the data-driven modeling is to achieve the same purpose but able to avoid the subjective uncertainty of experts and has higher accuracy and reliability^17,18,19,20.

Statistical analysis and machine learning (ML) modeling are two major data-driven approaches. The calculation process of the statistical models such as frequency ratio (FR), certainty coefficient (CF), information value (IV) and weight of evidence (WoE) is simple; and qualitative or categorical factors can be converted into quantitative weights by these approaches, and thence, they are widely employed for landslide risk assessment^15,21,22,23. However, the statistical models are sensitive to the nonlinear phenomena which require specific algorithms to sort them out^23,24.

Since the appearance of artificial intelligence, different ML algorithms including deep learning have been applied in the field of landslide risk mapping^{11,25,26,27,28}. Based on the target definition, or rather, collection of samples for training, ML approaches can automatically analyze and extract rules from the input data to make predictions¹⁴. Meanwhile, it is highly efficient in calculating high-dimension data and can fit the nonlinear relationships between target and factors^8,29,30,31. Nevertheless, the prediction accuracy of the most studies, even including those harnessing the hotspotted deep learning techniques^32,33,34,35, comes between 75 and 85%, except for those of Huangfu et al.³⁶, Ou et al.²⁶, Zhang et al.²⁷ and Zhou et al.²⁸, who have achieved landslide risk prediction with an accuracy of 86–94.54%. This is not ideal for government to target effectively and accurately the high risk zones for implementing disaster reduction and prevention measures in the subtropical areas. Hence, it is necessary to effectuate some improvement in certain technical aspect of the ML approaches.

It has been decades since hybrid models were proposed for landslide risk assessment. Hybrid models are in fact constructed by integrating two or more models in aspect of sample selection^28,37, feature selection^21,38, information extraction and finally landslide hazard prediction with reasonable accuracy^{10,22,25,39,40,41}. Hence, hybrid modeling has gained recently a momentum in improving the accuracy and reliability of landslide risk mapping^{26,36,40,42,43}. However, there are still uncertainty in processing both categorical and continuous factors which may influence directly the prediction accuracy.

Based on the above understanding, the main objective of this study is to improve the landslide risk modeling and prediction using hybrid models by coupling WoE with ML algorithms such as Logistic Regression (LR), Support Vector Machine (SVM) and Random Forest (RF) taking Chongren, Jiangxi, China, a typical county in the subtropical area, as an example. A specific objective is to test the effectiveness of the discretization approach based on entropy to see whether it can bring us the expected improvement while discretizing the continuous factors.

Data and methodology

The methodological procedures involved in the research are depicted as follows: (1) data preparation of landslide samples and geo-environmental factors; (2) entropy-based optimal discretization of the continuous factors; (3) WoE-based processing of both continuous and categorical geo-environmental factors and establishment of the hybrid models; (4) modeling and mapping of landslide susceptibility; (5) accuracy assessment and validation of the proposed models (Fig. 1).

Study area

Chongren is a county situated in the central part of Jiangxi, within the extent of longitude from 115° 49′ 16″ E to 116° 16′ 55″ E and latitude from 27° 24′ 29″ N to 27° 57′ 29″ N (Fig. 2), encompassing an area of 1520 km². The general landform is an incomplete hilly basin surrounded by mountains on three sides and opening toward the northeast. The annual average temperature from 1981 to 2010 is 17.6 °C, and the annual average precipitation from 1959 to 2017 is 1783.8 mm driven by monsoon in the subtropical climate zone. There are more than 140 small rivers or streams in the study area with an accumulated running course of 910 km. All these rivers or streams constitute a part of the Fuhe River watershed as tributaries and subtributaries. Geologically, the exposed strata are from the Upper Proterozoic, e.g., Sinian (Nanhua) to the Upper Palaeozoic, e.g., Devonian, Carboniferous, and to the Mesozoic, i.e., Triassic, Jurassic, and Cretaceous and at last the Quaternary. Since the Proterozoic era, the study area had experienced sedimentation, magmatism, tectonism and metamorphism with intense and complex development and transformation, forming a complex structural pattern composed of tectonic entities such as ductile faults, superimposed folds, brittle faults and depression basins.

Regarding the geological disasters, small-scale shallow landslides are dominant in the study area. After slope cutting for infrastructure construction, the natural loose deposits (i.e., soil) or cracked rock masses (mainly phyllitic slate and rocks with downslope bedding or fracture) lose support and balance, forming a new free dangling surface. In case of heavy rainfall, the slope slips downward due to heavy load and instability. Such landslides generally have no signs, and the time from creeping to occurrence of an obvious slip is short, which, therefore, often causes major geological disasters leading to house collapse and casualties. Moreover, in the site of such landslides, a new scarp (or back wall) is formed, inducing the generation of new landslides at the trailing edge of the slope. This process is the same as the development of headward erosion in a slope valley, producing a chain of landslides.

Field investigation revealed that heavy rains triggered several landslides near the town Xiangshan on July 7, 2019, severely blocking the traffic with more than 30,000 m³ of landslide bodies; and on August 23, 2017, a landslide with a total volume of about 10,000 m³ occurred in the village Pingshan due to a rainstorm, causing power outage, interruption of telecommunication and severe road congestion.

Field observation data

The prediction of landslide disaster based on data-driven method is to calculate the probability of landslide occurrence in the study area by fitting the relationship between the historical landslides and the geo-environmental factors⁴⁴. A detailed field survey of the historical landslides in the past decade was conducted in Chongren during the campaign of 1/50,000 Geological Disaster Survey by the 264 Geological Brigade of Jiangxi Nuclear Industry in 2017 and 588 landslides that took place in the period 2008–2017 (Fig. 3) were obtained as points. In reference to Google Earth (©Google) images, these landslide points were verified and vectorized into polygons. Meanwhile, the same number of stable points were stochastically selected in the stable areas, e.g., where the slope is less than 3°. A value of 1 was assigned to landslides and 0 to non-landslide points. As proposed by Zhang et al.²⁷, Huangfu et al.³⁶, Ou et al.²⁶, and Zhou et al.²⁸, 70% of the landslides and non-landslide samples were randomly picked out to constitute a training set (TS) to model landslide susceptibility, and the remained landslides and non-landslide samples (30%) as a validation set (VS) to evaluate the accuracy of modeling.

Geo-environmental factors

Preparation

The occurrence of landslides is a consequence of the long-term joint action of the endogenous factors, i.e., geology, landform, vegetation and soil, etc., and the short-term predisposing factors, i.e., rainfall, earthquake and anthopogenic activities^18,27. According to previous research on the landslide-causative factors^27,28,36 and landslide field investigation in Chongren, geological and geomorphological data, hydrological data, land cover and transport system data were used to establish geoinformation datasets for landslide hazard analysis.

Geological factor layers such as lithology, geological boundary and faults were generated by vectorization, buffering, and rasterization from the 1/50,000 Geological Map (Fig. 4a,b). The soil data including soil types and texture were provided by the Bureau of Jiangxi Coal Geology.

Slope and aspect factor layers were extracted from the digital elevation model (DEM), ASTGTMV003 (30 m), which were obtained from NASA (www.earthdata.nasa.gov) (Fig. 4c,d). The topographic wetness index (TWI) was also calculated using DEM data (Fig. 5a), using Eq. (1)²⁰:

$$ {\text{TWI}} = {\text{ln}}{\raise0.7ex\hbox{${A_{s} }$} \!\mathord{\left/ {\vphantom {{A_{s} } {{\text{tan}}\beta }}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{${{\text{tan}}\beta }$}} $$

(1)

where A_S is the upslope area of contribution per unit length of contour (m²/m), and β is the slope gradient.

The normalized difference vegetation index (NDVI) is a good representative of vegetation dynamics and can hence be considered as a controlling factor of landslide. For this reason, the multiyear autumn average NDVI was adopted to reduce the influence of uncertainty factors related to cloud cover and vegetation phenological change. Obtained from the USGS data server, Landsat 5 TM (30 m) and Landsat 8 OLI (30 m) images of the period 2007–2017 were used for this purpose. These Landsat images were acquired in late autumn, i.e., late October and early November, when crops are mostly harvested and only forests and woodlands are still green. After atmospheric correction using the COST model^45,46,47, these Landsat images were employed for deriving the mean autumn NDVI (Fig. 5b), and Landsat 8 OLI images dated May 2017 and Sept 2019 were used for land cover mapping (Fig. 5c) using the approach developed by Wu et al.²⁹.

Daily precipitation data from 2008 to 2017 were obtained from 14 meteorological stations in Chongren. Our previous studies revealed that the precipitation from May to July has a higher impact on the landslide occurrence than the combination of other months^27,28. Thus, the May–July accumulated mean rainfall was generated by interpolation approach of the Inverse Distance Weighting (IDW) (Fig. 5d).

Linear feature factors such as roads and rivers were vectorized from Google Earth (©Google) (Fig. 6a,b) and buffered into belts with intervals at 30, 60, 90, 120 and 150 m, respectively.

Optimal discretization of the continuous factors

The supervised discretization approach based on entropy was used to divide the continuous variables into intervals to realize optimal discretization. Using the entropy value to represent the purity of the dataset after partition is the basic idea of the approach. The smaller the entropy, the greater the data purity and the higher the availability of the discrete data obtained. The formula of entropy is presented as follows:

$$ E = \sum { - P_{i} \log_{2} P_{i} } $$

(2)

where P_i represents the probability of class i of sample appearing in the data interval. The results of division for continuous factors are shown in Table 1.

Table 1 The weight contrasts (C) of the geo-environmental factors.

Full size table

WoE-based processing of geo-environmental factors

Originally developed for mineral potential mapping based on Bayesian probability by Bonham-Carter et al.⁴⁸, WoE has been introduced into the prediction of landslide hazard in recent years and achieved a good result¹⁵. The weight values of the evidential variables (i.e., geo-environmental factors) are statistically calculated by the spatial relationship of landslide events with geo-environmental factors^7,49.

The positive weight (W⁺) and negative weight (W⁻) are provided by the following equations:

$$ W^{ + } = \ln \frac{{P(B{|}D)}}{{P(B|\overline{D})}} $$

(3)

$$ W^{ - } = \ln \frac{{P(\overline{B}|D)}}{{P(\overline{B}|\overline{D})}} $$

(4)

where W⁺ and W⁻ are the weighted values of the occurrence and non-occurrence of the observed geo-environmental factor, respectively. B and $\overline{B}$ is occurrence and non-occurrence of the geo-environmental factor, respectively; D and $\overline{D}$ are the occurrence and non-occurrence of landslide events, respectively; P is the probability^7,49.

The weight contrast (C) is a global measurement of the spatial interconnection between the landslide points and the geo-environmental factors, incorporating the effects of the W⁺ and W⁻. Calculation of C is shown as follows⁴⁸:

$$ C = W^{ + } - W^{ - } $$

(5)

where if C is > 0, it indicates that the occurrence of landslide is positively correlated with the geo-environmental factor; and if C is < 0, it implies that the occurrence of landslide is negatively correlated with the geo-environmental factor. The weight of evidence values of all the geo-environmental factors are shown in Table 1.

Each interval of the divided continuous factors and each type of feature within the categorical factor were considered as a “subset”. The positive weight (W⁺) and negative weight (W⁻) of different intervals or subsets for the geo-environmental factors were calculated using Eqs. (3) and (4). Lithology, soil type, soil texture, distance to faults, distance to geological boundary, distance to rivers, distance to roads, elevation, slope, aspect, TWI, autumn mean NDVI, May–July accumulated mean rainfall and land use were transformed into raster layers with 30 m resolution as input variables (e.g., C values) for WoE-based hybrid modeling.

The calculation of WoE and C are implemented within Arc-WofE, an extension to ArcView 3.3 developed jointly by the USGS and the Geological Survey of Canada⁵⁰.

Machine learning modeling

Based on the WoE calculation, the following machine learning algorithms were applied for landslide susceptibility modeling, or rather, hybrid modeling. LR model was established within SPSS 19.0 software, meanwhile, SVM and RF modeling was implemented within EnMap-Box 2.11, a software package developed using Interactive Data Language (IDL)⁵¹.

LR modeling

(1)
Collinearity analysis

Prior to the LR modeling, it is necessary to understand the collinearity among the independent variables, that is to say, to ascertain whether there exists linear correlation among the independent geo-environmental factors. This collinearity may lead to an instability of the LR model and affect the contribution of variables to the model⁵². Common indicators to evaluate the collinearity of geo-environmental factors are the variance inflation factor (VIF) and tolerances (TOL)⁵³. The statistical model and LR require that there be no collinearity among the factors, that is, TOL > 0.1 and VIF < 10^27,54.
(2)
LR modeling

LR is an algorithm that learns a model for binary classification^46,55 whose kernel function is sigmoid (Eq. 6).

$$ p(x) = \frac{1}{{1 + e^{ - x} }} $$

(6)

The purpose of the conventional regression algorithms is to fit a polynomial function (Eq. 7) that minimizes the error between the prediction and the reality.

$$ f(x) = c_{0} + c_{1} x_{1} + \ldots + c_{n} x_{n} $$

(7)

where x_i (i = 1, 2, 3, … n) are independent features of the samples; c_i (i = 1, 2, 3, … n) are the coefficients of the features, and c₀ is a constant. f(x) is transformed into a sigmoid function so that it has a good logistic judgment property and can directly express the probability in which the sample with the given features is classified into a certain class. p(x) = 1 is the probability of samples being assigned to category 1, then p(x)/(1 − p(x)) is defined as odds ratio (OR) to introduce the natural logarithm (Eq. 8).

$$ f(x) = {\text{ln}}(\frac{p(x)}{{1 - p(x)}}) $$

(8)

p(x) is expressed as following function (9):

$$ p(x) = \frac{1}{{1 + e^{{ - (c_{0} + c_{1} x_{1} + \ldots + c_{n} x_{n}^{{}} )}} }} $$

(9)

The training samples and their corresponding attributes of environmental factors were inputted into a statistic package SPSS 19.0 to calculate the coefficients of environmental factors. Then, in the GIS environment, the probability of landslide occurrence in the study area was calculated through formula (9).

SVM modeling

As a classical classification and regression algorithm, SVM has clear advantages in dealing with high-dimensional data with limited samples. SVM attempts to find or construct a set of hyperplanes through kernel functions to separate clusters that are usually not linearly separable in low-dimensional feature space, minimizing the empirical error and uncertainty to improve the generalization performance^56,57. The kernel functions include Linear, Polynomial, Sigmoid and Radial Basis Functions (RBF), among which the RBF, similar to Gaussian distribution and thus termed also Gaussian function (Eq. 10), performed best^29,30 and has been widely used in classification and regression as it has fewer parameters and stronger flexibility³⁴. The RBF kernel was hence used to establish the SVM model in this study.

$$ k\left( {x_{i} ,x_{j} } \right) = exp\left( { - g\left\| {x_{i} - x_{j} } \right\|^{2} } \right) $$

(10)

where x_i and x_j are the input vectors, and g is the width parameter of the Gaussian kernel function k.

RF modeling

RF is a decision-trees-based classification and regression algorithm that outputs the final outcome by voting all the results of these trees⁵⁸. The classification decision-maker used in the RF algorithm is the Classification and Regression Tree (CART)⁵⁹. The training samples of the decision-trees are obtained by randomly replaceable sampling in the original TS. The remaining samples, called the out-of-bag (OOB) data, are used for establishing an unbiased estimate of error during generalization and estimating the importance of each factor. The metric of attribute of CART in branch processing is Gini Coefficient (Eq. 11).

$$ Gini = 1{ - }\sum\limits_{i = 1}^{2} {p_{i}^{2} } $$

(11)

where p_i represents the probability of which the observed sample falls in category i, so the probability of this sample being misclassified is (1 − p_i).

In order to distinguish each predictor in the ensemble classifier, a specific number of variables are stochastically selected for generating the necessary nodes in the decision-tree. This construction method enables the RF to further improve the prediction performance through the increase of the difference among the individual classification trees and to avoid over-fitting. The number of variables at each node can be the square root of all features or logarithm (log) of all features or a user-defined value. In this study, the square root of all features, 4, was selected.

Model performance assessment

The confusion matrix is often used for evaluation of the performance of the ML models. It mainly includes the following basic indicators: True Positive (TP) is the number of landslide samples correctly predicted by the model; False Negative (FN) is the number of landslide samples wrongly predicted as stable points by the model; False Positive (FP) is the number of stable samples mistakenly classified as landslide samples; True Negative (TN) is the number of stable samples correctly predicted by the model. The performance indicators of landslide hazard model, e.g., Precision, Recall, F-measure, Kappa Coefficient (KC), Overall Accuracy (OA) and AUC [area under the Receiver Operating Characteristic (ROC) curve], were calculated on the basis of confusion matrix^8,34.

According to previous studies, the smaller the very high susceptible zone and the more landslide samples predicted, the higher the accuracy of the landslide risk map⁶⁰. To assess the accuracy of the latter, the FR was also calculated, which is the ratio of the percentage of the cell number of landslides at each susceptibility level to the percentage of the cell number of each hazard level⁶¹. For a reliable landslide prediction model, the very high risk level shall possess the highest FR.

Results

Collinearity of the geo-environmental factors

As demonstrated in Table 2, the minimum TOL and maximum VIF values of the variables processed by WoE method were 0.878 and 1.139, respectively. The collinearity of WoE-based variables was significantly lower than that of the original variables, in which the minimum TOL and the maximum VIF are 0.215 and 4.642, respectively. Processing based on WoE can effectively reduce the collinearity among the factors. The collinearity among the geo-environmental factors selected for this research is low, and thus, they can be used for susceptibility modeling.

Table 2 Regression coefficients (β) and collinearity of the variables.

Full size table

Hybrid models

WoE-based LR models

Regression coefficient (β) and R² of the WoE-based LR model is shown in Table 2. The single LR model was also established for comparison. The fitting degree of the WoE-based LR Model (R² = 0.886) was better than that of the single model (R² = 0.707). The WoE-based LR and single LR model were expressed using Eqs. (12) and (13). The probabilities of the landslide are calculated as follows:

$$ p(x) = \frac{1}{{1 + e^{{ - ({ - }8.685{ - }0.028x_{1} + \ldots { - }0.136x_{15}^{{}} )}} }} $$

(12)

$$ p(x) = \frac{1}{{1 + e^{{ - (1.119 + 0.929x_{1} + \ldots + 1.119x_{n}^{{}} )}} }} $$

(13)

where x₁-lithology, x₂-geological boundary, x₃-fault, x₄-slope, x₅-aspect, x₆-elevation, x₇-land use, x₈-NDVI, x₉-May–July mean rainfall, x₁₀-river, x₁₁-road, x₁₂-sand, x₁₃-clay, x₁₄-soil type and x₁₅-TWI.

According to the modeled probability of each cell, the landslide risk zoning maps from WoE-based LR and the single LR model were created.

WoE-based SVM model

The width parameter g and the regularization parameter c of the optimal Gaussian kernel function were obtained by using the internally validated 2D grid search method, which were 1, 0.1 and 0.1, 100 in the WoE-based SVM and the single SVM model respectively. The c parameter indicates the penalty level for the error item⁸. The c value of the single SVM model was much higher than that of the WoE-based SVM model, implying that the penalty of the single SVM model for misclassification of the samples in the training process was bigger than that of the WoE-based SVM model, implying that the latter has stronger generalization capacity.

WoE-based RF model

The number of decision-trees (NT) has an important effect on the accuracy of RF model. The prediction performance of RF is poor when NT is small, and it becomes better when NT is larger. However, with the increase of NT, the complexity of RF model gradually increases, and the modeling time is also longer. Several experiments show that when NT was increased to 300, the prediction performance of RF was stable²⁸. Based on this, the RF model for predicting landslide hazard was established with the NT of 300.

Landslide susceptibility maps (LSM)

The generated probability of landslide occurrence from the above hybrid models was reclassified into five levels: 0–0.2, 0.2–0.4, 0.4–0.6, 0.6–0.8 and 0.8–1, representing the five levels of landslide susceptibility, i.e., very low, low, moderate, high and very high, and the zoning maps are presented in Fig. 7. It is seen that most of the occurred landslides are distributed along the roads.

As revealed in Table 3, the very high susceptibility areas of the WoE-based LR and single LR, the WoE-based SVM and single SVM, the WoE-based RF and single RF were 88.80 km², 110.78 km², 137.47 km², 110.78 km², 77.87 km², 79.13 km², respectively, accounting for 5.94%, 7.30%, 9.06%, 8.71%, 5.93% and 6.43% of the studied territory, respectively. In all landslide susceptibility maps, FR values range from 0.01 to 14.05, and the very low risk level had also the very low FR and vice versa. With the increase of the susceptibility level, the area of the corresponding level decreases and the percentage of landslides increases, denoting the high prediction accuracy by all the coupled hybrid models. Our analysis also exhibits that the WoE-based RF modeling map grasps the highest FR but with the least surface area at very high risk level, indicating that this hybrid model performs better than others and may allow us to target accurately the zones for implementing landslide risk reduction and prevention measures.

Table 3 Landslide distribution with different susceptibility levels.

Full size table

Comparison of the LSMs

As shown in Table 4, the statistic indicators based on the confusion matrix show that the OA and KC of the coupled hybrid models, i.e., WoE-based LR, WoE-based SVM and WoE-based RF, were 82.35%, 87.86%, 91.20% and 0.6470, 0.7573, 0.8199 respectively, and the OA and KC of the single models of LR, SVM and RF were 76.75%, 81.00%, 89.00% and 0.5350, 0.6210, 0.7800 respectively. It is evident that the coupled hybrid models are able to effectuate a prediction with higher accuracy than the single models, and the WoE-based RF model had the highest OA and KC, and hence performed best. In accordance with the FR calculated by the landslide risk map, the accuracy and reliability of the coupled models with WoE-based variables are improved with regard to the single prediction model.

Table 4 The statistic indicators based on the confusion matrix versus the validation set (VS).

Full size table

The ROC curves and AUC of the coupled hybrid models in this study are shown in Fig. 8. It is seen that AUC of the WoE-based LR, WoE-based SVM and WoE-based RF are 0.912, 0.950 and 0.970 respectively, and that of the single models of LR, SVM and RF are 0.905, 0.917, 0.954, respectively.

Discussion

Advantages of the hybrid modeling

Based on the optimal discretization of the continuous factors, the WoE approach itself is able to provide the probability information of landslide in line with the a priori knowledge of the contribution of each geo-environmental factor to the historical landslides¹⁵. This should be favorable for the successive ML modeling of the landslide susceptibility. As a preprocessing approach, WoE has the following advantages: (1) the response degree of different subsets or intervals of these factors to landslide occurrence is quantitatively evaluated by the evidence weight; (2) the categorical variables are converted into numerical ones without subjective assignment; (3) the interference of outliers to the model is reduced by providing evidence weights to the geo-environmental factors. Hence, the WoE can simplify the ML processes and improve their prediction accuracy.

This research illustrates that WoE-based ML modeling performs better than single ML model and may lead to a reliable prediction, and the RF algorithm performs better than LR and SVM algorithms. The integration and random sampling characteristics make the RF model to have clear advantages over the others in the following aspects: (1) prediction less affected by the disturbance of data, (2) higher accuracy, and (3) more effective to prevent over-fitting thanks to using the Strong Law of Large Numbers for construction of the decision-trees. Some authors have specifically discussed the performance of ML models in predicting landslide hazard and showed that the RF algorithm may derive a higher prediction accuracy than other models, and is hence more suitable for landslide susceptibility mapping^{11,14,18,27,28,62,63}. Our result is consistent with the conclusions of these authors.

Comparison with other researches

As above mentioned, the reasonable processing, e.g., discrete processing of the continuous geo-environmental factors, together with WoE can improve the performance of ML models^10,21,38. In this research, the OA and KC of all the coupled models are better than those of single models, which reflects the usefulness of such preprocessing prior to ML modeling.

The landslide susceptibility of the Chongren area had also been modeled by other authors. The one of Hong et al.⁶⁴ shows that the index of entropy (IOE) model obtains a better accuracy than other binary models with an AUC value of 0.817. Two other studies conducted by Chen et al.^62,65 show that RF can achieve satisfactory results among the ML algorithms with an AUC value of 0.851. Compared with the existing works, even those conducted in other areas with deep learning techniques, the accuracy of this study, with AUC values of 0.912–0.970, is greatly improved. This implies the effectiveness of the WoE-based hybrid ML modeling and entropy-based optimal discretization of the continuous factors. Thus, the methodology proposed in this study is considered effective and extendable to other subtropical areas for landslide hazard mapping.

Conclusions

This paper presents an integrated study on landslide hazard mapping taking Chongren county as an example. Though the single known ML algorithm including deep learning and even the hybrid models have been applied by other researchers, the methodology proposed in this study, composed of an integrated procedure as mentioned above, does make an improved landslide risk prediction possible.

Our study reveals the effectiveness of the hybrid modeling for landslide risk mapping in which the WoE was applied for preprocessing the geo-environmental factors and ML algorithms for modeling. The coupled hybrid models, e.g., WoE-based LR, WoE-based SVM and WoE-based RF, have higher precision and better generalization ability than the single models for landslide hazard prediction. We also note that the decision-tree-based ensemble algorithm has achieved rather satisfactory results in comparison with others and that the WoE-based RF model offers a robust landslide prediction, and will be hence recommended for the similar landslide prediction elsewhere.

As we have noted, road construction is the most important geo-environmental factor provoking landslides and this confirms what we have observed in previous studies^26,27,28,36. This requires our attention to the potential disaster that may be induced while planning future urbanization and road development.

Another innovation of this research is using the optimal discretization approach for numeric factors prior to the application of the WoE approach. After this, the landslide susceptibility prediction based on ML algorithms becomes more reliable. We believe that our research provides an operational methodology for predicting the hazard of landslide and collapse in the subtropical area, and may serve better for local authorities to accurately target the risk zones to implement disaster early warning and prevention measures.

References

Malet, J. P. & Maquaire, O., 2008. Risk assessment methods of landslides, Ramsoil, risk assessment methodologies for soil threats, Sixth Framework Programme, Project Report 2.2.
Bandara, A. et al. A Generalized Ensemble Machine Learning Approach for Landslide Susceptibility Modeling. Data Management, Analytics and Innovation Vol 1073 71–93 (Springer Singapore, 2020). https://doi.org/10.1007/978-981-13-9364-8_6.
Book Google Scholar
Bathrellos, G. D., Skilodimou, H. D., Chousianitis, K., Youssef, A. M. & Pradhan, B. Suitability estimation for urban development using multi-hazard assessment map. Sci. Total Environ. 575, 119–134. https://doi.org/10.1016/j.scitotenv.2016.10.025 (2017).
Article ADS CAS PubMed Google Scholar
Ma, Z., Mei, G. & Piccialli, F. Machine learning for landslides prevention: A survey. Neural Comput. Appl. 33(17), 10881–10907. https://doi.org/10.1007/s00521-020-05529-8 (2021).
Article Google Scholar
Aleotti, P. & Chowdhury, R. Landslide hazard assessment: Summary review and new perspectives. Bull. Eng. Geol. Environ. 58(1), 21–44 (1999).
Article Google Scholar
Bathrellos, G. D., Kalivas, D. P. & Skilodimou, H. D. GIS-based landslide susceptibility mapping models applied to natural and urban planning in Trikala, Central Greece. Estudios Geol. 65(1), 49–65. https://doi.org/10.3989/egeol.08642.036 (2009).
Article Google Scholar
Chong, X. et al. Landslide hazard mapping using GIS and weight of evidence model in Qingshui River watershed of 2008 Wenchuan earthquake struck region. J. Earth Sci. 23(1), 97–120. https://doi.org/10.1007/s12583-012-0236-7 (2012).
Article Google Scholar
Huang, Y. & Zhao, L. Review on landslide susceptibility mapping using support vector machines. CATENA 165, 520–529. https://doi.org/10.1016/j.catena.2018.03.003 (2018).
Article Google Scholar
Peethambaran, B., Anbalagan, R., Shihabudheen, K. V. & Goswami, A. Robustness evaluation of fuzzy expert system and extreme learning machine for geographic information system-based landslide susceptibility zonation: A case study from Indian Himalaya. Environ. Earth Sci. 78(6), 231. https://doi.org/10.1007/s12665-019-8225-0 (2019).
Article Google Scholar
Sameen, M. I. et al. Landslide spatial modelling using unsupervised factor optimisation and regularised greedy forests. Comput. Geosci. 134, 104336. https://doi.org/10.1016/j.cageo.2019.104336 (2020).
Article Google Scholar
Achour, Y. & Pourghasemi, H. R. How do machine learning techniques help in increasing accuracy of landslide susceptibility maps?. Geosci. Front. 11(03), 153–165. https://doi.org/10.1016/j.gsf.2019.10.001 (2020).
Article Google Scholar
Burger, J. Environmental management: Integrating ecological evaluation, remediation, restoration, natural resource damage assessment and long-term stewardship on contaminated lands. Sci. Total Environ. 400(1–3), 6–19. https://doi.org/10.1016/j.scitotenv.2008.06.041 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Kavoura, K. & Sabatakakis, N. Investigating landslide susceptibility procedures in Greece. Landslides 17(1), 127–145. https://doi.org/10.1007/s10346-019-01271-y (2020).
Article Google Scholar
Merghadi, A. et al. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth Sci. Rev. 207, 103225. https://doi.org/10.1016/j.earscirev.2020.103225 (2020).
Article Google Scholar
Othman, A. A., Gloaguen, R., Andreani, L. & Rahnama, M. Improving landslide susceptibility mapping using morphometric features in the Mawat area, Kurdistan Region, NE Iraq: Comparison of different statistical models. Geomorphology 319, 147–160. https://doi.org/10.1016/j.geomorph.2018.07.018 (2018).
Article ADS Google Scholar
Corominas, J. et al. Recommendations for the quantitative analysis of landslide risk. Bull. Eng. Geol. Environ. 73(2), 209–263. https://doi.org/10.1007/s10064-013-0538-8 (2014).
Article Google Scholar
Myronidis, D., Papageorgiou, C. & Theophanous, S. Landslide susceptibility mapping based on landslide history and analytic hierarchy process (AHP). Nat. Hazards 81(1), 245–263. https://doi.org/10.1007/s11069-015-2075-1 (2016).
Article Google Scholar
Ali, S. A. et al. GIS-based landslide susceptibility modeling: A comparison between fuzzy multi-criteria and machine learning algorithms. Geosci. Front. 12(2), 857–876. https://doi.org/10.1016/j.gsf.2020.09.004 (2021).
Article Google Scholar
Fell, R. et al. Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning. Eng. Geol. 102(3), 99–111. https://doi.org/10.1016/j.enggeo.2008.03.014 (2008).
Article Google Scholar
Zhou, S., Zhang, Y., Tan, X. & Abbas, S. M. A comparative study of the bivariate, multivariate and machine-learning-based statistical models for landslide susceptibility mapping in a seismic-prone region in China. Arab. J. Geosci. 14(6), 440. https://doi.org/10.1007/s12517-021-06630-5 (2021).
Article Google Scholar
Qin, Y. et al. Performance evaluation of five GIS-based models for landslide susceptibility prediction and mapping: A case study of Kaiyang County, China. Sustainability 13, 11. https://doi.org/10.3390/su13116441 (2021).
Article Google Scholar
Song, R.-H., Hiromu, D., Kazutoki, A., Usio, K. & Sumio, M. Modeling the potential distribution of shallow-seated landslides using the weights of evidence method and a logistic regression model: A case study of the Sabae Area, Japan. Int. J. Sedim. Res. 23(2), 106–118. https://doi.org/10.1016/S1001-6279(08)60010-4 (2008).
Article Google Scholar
Tang, Y. et al. Integrating principal component analysis with statistically-based models for analysis of causal factors and landslide susceptibility mapping: A comparative study from the loess plateau area in Shanxi (China). J. Clean. Prod. 277, 124159. https://doi.org/10.1016/j.jclepro.2020.124159 (2020).
Article Google Scholar
Thiery, Y., Malet, J. P., Sterlacchini, S., Puissant, A. & Maquaire, O. Landslide susceptibility assessment by bivariate methods at large scales: Application to a complex mountainous environment. Geomorphology 92(1), 38–59. https://doi.org/10.1016/j.geomorph.2007.02.020 (2007).
Article ADS Google Scholar
Chen, W., Pourghasemi, H. R., Kornejady, A. & Zhang, N. Landslide spatial modeling: Introducing new ensembles of ANN, MaxEnt, and SVM machine learning techniques. Geoderma 305, 314–327. https://doi.org/10.1016/j.geoderma.2017.06.020 (2017).
Article ADS Google Scholar
Ou, P., Wu, W., Qin, Y., Zhou, X. & Liu, W. Assessment of landslide hazard in Jiangxi using geo-information. Front Earth Sci. China 9, 648342. https://doi.org/10.3389/feart.2021.648342 (2021).
Article Google Scholar
Zhang, Y. et al. Mapping landslide hazard risk using random forest algorithm in Guixi, Jiangxi, China. ISPRS Int. J. Geo-Inf. 9, 11. https://doi.org/10.3390/ijgi9110695 (2020).
Article Google Scholar
Zhou, X. et al. Zonation of landslide Susceptibility in Ruijin, Jiangxi, China. Int. J. Environ. Res. Public Health 18(11), 5906. https://doi.org/10.3390/ijerph18115906 (2021).
Article PubMed PubMed Central Google Scholar
Wu, W., Zucca, C., Karam, F. & Liu, G. Enhancing the performance of regional land cover mapping. Int. J. Appl. Earth Obs. Geoinf. 52, 422–432. https://doi.org/10.1016/j.jag.2016.07.014 (2016).
Article ADS CAS Google Scholar
Wu, W. et al. Soil salinity prediction and mapping by machine learning regression in Central Mesopotamia, Iraq. Land Degrad. Dev. 29, 4005–4014. https://doi.org/10.1002/ldr.3148 (2018).
Article Google Scholar
Guo, Z., Shi, Y., Huang, F., Fan, X. & Huang, J. Landslide susceptibility zonation method based on C5.0 decision tree and K-means cluster algorithms to improve the efficiency of risk management. Geosci. Front. 12(6), 101249. https://doi.org/10.1016/j.gsf.2021.101249 (2021).
Article Google Scholar
Dong, V. D. et al. A spatially explicit deep learning neural network model for the prediction of landslide susceptibility. CATENA 188, 104451. https://doi.org/10.1016/j.catena.2019.104451 (2020).
Article Google Scholar
Huang, F. et al. A deep learning algorithm using a fully connected sparse autoencoder neural network for landslide susceptibility prediction. Landslides 17, 217–229. https://doi.org/10.1007/s10346-019-01274-9 (2020).
Article Google Scholar
Pham, B. T. et al. Coupling RBF neural network with ensemble learning techniques for landslide susceptibility mapping. CATENA 195, 104805. https://doi.org/10.1016/j.catena.2020.104805 (2020).
Article Google Scholar
Zhu, L. et al. Landslide susceptibility prediction using sparse feature extraction and machine learning models based on GIS and remote sensing. IEEE Geosci. Remote Sens. Lett. https://doi.org/10.1109/LGRS.2021.3054029 (2021).
Article Google Scholar
Huangfu, W. et al. Landslide geo-hazard risk mapping using logistic regression modeling in Guixi, Jiangxi, China. Sustainability 13, 9. https://doi.org/10.3390/su13094830 (2021).
Article CAS Google Scholar
Huang, F. et al. Landslide susceptibility prediction based on a semi-supervised multiple-layer perceptron model. Landslides 17(12), 2919–2930. https://doi.org/10.1007/s10346-020-01473-9 (2020).
Article Google Scholar
Chen, W. et al. GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. CATENA 164, 135–149. https://doi.org/10.1016/j.catena.2018.01.012 (2018).
Article Google Scholar
Panahi, M., Gayen, A., Pourghasemi, H. R., Rezaie, F. & Lee, S. Spatial prediction of landslide susceptibility using hybrid support vector regression (SVR) and the adaptive neuro-fuzzy inference system (ANFIS) with various metaheuristic algorithms. Sci. Total Environ. 741, 139937. https://doi.org/10.1016/j.scitotenv.2020.139937 (2020).
Article ADS CAS PubMed Google Scholar
Pham, B. T. et al. GIS-based ensemble soft computing models for landslide susceptibility mapping. Adv. Sp. Res. 66(6), 1303–1320. https://doi.org/10.1016/j.asr.2020.05.016 (2020).
Article ADS Google Scholar
Zhu, L. et al. Landslide susceptibility prediction modeling based on remote sensing and a novel deep learning algorithm of a cascade-parallel recurrent neural network. Sensors 20, 1576. https://doi.org/10.3390/s20061576 (2020).
Article ADS PubMed Central Google Scholar
Zhang, T.-Y., Mao, Z.-A. & Wang, T. GIS-based evaluation of landslide susceptibility using a novel hybrid computational intelligence model on different mapping units. J. Mt. Sci. 17(12), 2929–2941. https://doi.org/10.1007/s11629-020-6393-8 (2020).
Article Google Scholar
Li, W. et al. Uncertainties analysis of collapse susceptibility prediction based on remote sensing and GIS: Influences of different data-based models and connections between collapses and environmental factors. Remote Sens. 12, 24. https://doi.org/10.3390/rs12244134 (2020).
Article CAS Google Scholar
Guzzetti, F., Reichenbach, P., Cardinali, M., Galli, M. & Ardizzone, F. Probabilistic landslide hazard assessment at the basin scale. Geomorphology 72(1–4), 272–299. https://doi.org/10.1016/j.geomorph.2005.06.002 (2005).
Article ADS Google Scholar
Chavez, P. S. Image-based atmospheric correction-revisited and improved. Photogramm. Eng. Remote. Sens. 62(9), 1025–1036. https://doi.org/10.1016/0031-0182(96)00019-3 (1996).
Article Google Scholar
Wu, W. (2003). Application de la geomatique au suivi de la dynamique environnementale en zones arides. Université Panthéon-Sorbonne-Paris I.
Wu, W., De Pauw, E. & Hellden, U. Assessing woody biomass in African tropical savannas by multiscale remote sensing. Int. J. Remote Sens. 34(13), 4525–4549. https://doi.org/10.1080/01431161.2013.777487 (2013).
Article Google Scholar
Bonham-Carter, G., Agterberg, F. & Wright, D. Weights of evidence modelling: A new approach to mapping mineral potential. Stat. Appl. Earth Sci. Geol. Surv. Can. Pap. 89–9, 171–183 (1989).
Google Scholar
Westen, C., Rengers, N. & Soeters, R. Use of geomorphological information in indirect landslide susceptibility assessment. Nat. Hazards 30(3), 399–419. https://doi.org/10.1023/B:NHAZ.0000007097.42735.9e (2003).
Article Google Scholar
Bonham-Carter, G. F. & Agterberg, F. P. Arc-WofE: A GIS tool for statistical integration of mineral exploration datasets. Bull. Int. Stat. Inst. 58(2), 497–500 (1999).
Google Scholar
Waske, B. et al. imageRF—a user-oriented implementation for remote sensing image analysis with random forests. Environ. Model. Softw. 35(1), 192–193. https://doi.org/10.1016/j.envsoft.2012.01.014 (2012).
Article Google Scholar
Li, Y.-F., Xie, M. & Goh, T.-N. Adaptive ridge regression system for software cost estimating on multi-collinear datasets. J. Syst. Softw. 83(11), 2332–2343. https://doi.org/10.1016/j.jss.2010.07.032 (2010).
Article Google Scholar
Ayalew, L. & Yamagishi, H. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan. Geomorphology 65(1–2), 15–31. https://doi.org/10.1016/j.geomorph.2004.06.010 (2005).
Article ADS Google Scholar
Cao, J., Zhang, Z., Wang, C., Liu, J. & Zhang, L. Susceptibility assessment of landslides triggered by earthquakes in the Western Sichuan Plateau. CATENA 175, 63–76. https://doi.org/10.1016/j.catena.2018.12.013 (2019).
Article Google Scholar
Budimir, M. E. A., Atkinson, P. M. & Lewis, H. G. A systematic review of landslide probability mapping using logistic regression. Landslides 12(3), 419–436. https://doi.org/10.1007/s10346-014-0550-5 (2015).
Article Google Scholar
Vapnik, V. & Lerner, A. Pattern recognition using generalized portrait method. Autom. Remote. Control. 24, 774–780 (1963).
Google Scholar
Vapnik, V. The Nature of Statistical Learning Theory (Springer, 2000). https://doi.org/10.1007/978-1-4757-3264-1 (978-1-4757-3264-1).
Book MATH Google Scholar
Breiman, L. Random forests. Mach. Learn. 45(1), 5–32 (2001).
Article Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A. & Stone, C. J. Classification and regression trees (CART). Biometrics 40(3), 582–588. https://doi.org/10.2307/2530946 (1984).
Article MATH Google Scholar
Dou, J. et al. Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci. Total Environ. 662, 332–346. https://doi.org/10.1016/j.scitotenv.2019.01.221 (2019).
Article ADS CAS PubMed Google Scholar
Huang, F. et al. Uncertainty study of landslide susceptibility prediction considering the different attribute interval numbers of environmental factors and different data-based models. CATENA 202, 105250. https://doi.org/10.1016/j.catena.2021.105250 (2021).
Article Google Scholar
Chen, W. et al. Landslide susceptibility modelling using GIS-based machine learning techniques for Chongren County, Jiangxi Province, China. Sci. Total Environ. 626(2018), 1121–1135. https://doi.org/10.1016/j.scitotenv.2018.01.124 (2018).
Article ADS CAS PubMed Google Scholar
Depicker, A. et al. The added value of a regional landslide susceptibility assessment: The western branch of the East African Rift. Geomorphology 353, 106886. https://doi.org/10.1016/j.geomorph.2019.106886 (2020).
Article Google Scholar
Hong, H. et al. Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy. Geocarto Int. 32, 139–154. https://doi.org/10.1080/10106049.2015.1130086 (2016).
Article Google Scholar
Chen, W. et al. Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling. Bull. Eng. Geol. Environ. 78, 4397–4419. https://doi.org/10.1007/s10064-018-1401-8 (2019).
Article Google Scholar

Download references

Acknowledgements

A sincere gratitude shall be sent to the 264 Geological Brigade of the Jiangxi Nuclear Industry for their provision of the field data and geological map. Landsat imagery was obtained from the USGS data server (https://glovis.usgs.gov) and DEM data (ASTGTMV003 30m) from NASA (www.earthdata.nasa.gov); Google is acknowledged for making the very high resolution images available on Google Earth. This study was financially supported by the Start-up Funding for Scientific Research of the East China University of Technology (Grant No. DHTP2018001) and by the Jiangxi Talent Program (Grant No. 900/2120800004) to Dr Weicheng Wu, and also by the Special Innovation Fund for Postgraduate of the East China University of Technology to Ms Xiaoting Zhou (Grant No. YC2020-B158).

Author information

Authors and Affiliations

Key Laboratory of Digital Lands and Resources and Faculty of Earth Sciences, East China University of Technology, Nanchang, 330013, Jiangxi, China
Xiaoting Zhou, Weicheng Wu, Yaozu Qin & Xiao Fu

Authors

Xiaoting Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Weicheng Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yaozu Qin
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Fu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, X.Z. and W.W.; methodology, W.W. and X.Z.; software, W.W. and Y.Q.; validation, X.Z. and X.F.; investigation, X.Z., W.W., and Y.Q.; writing—original draft preparation, X.Z. and W.W.; writing—review and editing, W.W.; visualization, X.F.; supervision, W.W. and Y.Q.; and funding acquisition, W.W. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Weicheng Wu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhou, X., Wu, W., Qin, Y. et al. Geoinformation-based landslide susceptibility mapping in subtropical area. Sci Rep 11, 24325 (2021). https://doi.org/10.1038/s41598-021-03743-5

Download citation

Received: 28 August 2021
Accepted: 09 December 2021
Published: 21 December 2021
DOI: https://doi.org/10.1038/s41598-021-03743-5

This article is cited by

Research on the influence of different sampling resolution and spatial resolution in sampling strategy on landslide susceptibility mapping results
- Xianyu Yu
- Huihui Chen
Scientific Reports (2024)
Seismic landslide susceptibility assessment using principal component analysis and support vector machine
- Ziyao Xu
- Ailan Che
- Hanxu Zhou
Scientific Reports (2024)
Evaluating landslide susceptibility: an AHP method-based approach enhanced with optimized random forest modeling
- Xuedong Zhang
- Haoyun Xie
- Bo Chen
Natural Hazards (2024)
A comparative evaluation of landslide susceptibility mapping using machine learning-based methods in Bogor area of Indonesia
- Dian Nuraini Melati
- Raditya Panji Umbara
- Maria Susan Anggreainy
Environmental Earth Sciences (2024)
Geoinformatics-based frequency ratio, analytic hierarchy process and hybrid models for landslide susceptibility zonation in Kurdistan Region, Northern Iraq
- Kaiwan K. Fatah
- Yaseen T. Mustafa
- Imaddadin O. Hassan
Environment, Development and Sustainability (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Data and methodology

Study area

Field observation data

Geo-environmental factors

Preparation

Optimal discretization of the continuous factors

WoE-based processing of geo-environmental factors

Machine learning modeling

LR modeling

SVM modeling

RF modeling

Model performance assessment

Results

Collinearity of the geo-environmental factors

Hybrid models

WoE-based LR models

WoE-based SVM model

WoE-based RF model

Landslide susceptibility maps (LSM)

Comparison of the LSMs

Discussion

Advantages of the hybrid modeling

Comparison with other researches

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links