Introduction

In the natural forest environment of northeast China, down wood (DW) is often caused by natural factors, which are generally divided into biological and abiotic factors1. Biological factors are caused by the age of trees, unbalanced growth, consumption of animals, diseases, etc2. The wind, fire, lightning, environmental stress (floods, droughts, and high temperatures), chemical pollution, and climate change are the main abiotic factors3,4. Liangshui National Nature Reserve is located in Xiaoxing’an mountains; the wind is considered the main reason for a mass of DW, which can cause the breakage or uprooting of living trees. The loss of living trees will have a serious impact on the economy5,6,7. In addition, the ecological significance of DW cannot be ignored8,9. DW plays key roles in resource cycling10, maintaining biodiversity11,12,13 and changing the micro-environment of stand7,14,15. Hence, it is distinctly indispensable to predict the spatial distribution of occurrence and number of DW in natural forests.

At present, many scholars have carried out a great deal of work on the DW16, among which, it shows more advantages of building theoretical models for the sake of researching spatial distribution of DW, geographical environment, stand characteristics, and other driving factors. Generalized linear (GL) models are widely applied in predicting the probability occurrence and mathematical statistics of events17,18. Among them, logistic regression19,20 is widely applied in predict the occurrence of DW (ODW), and Poisson regression21 is widely applied in predict the number of DW (NDW) caused by forest ecological environment attributes. However, GL models assume that the global space is stable, but it is difficult to find the data that satisfies this condition in forestry and the ecological environment. The space of Tobler says, “Distance is an important factor in whether or not things are related22,23.” This fact suggests that tree growth and stand development are highly likely to be influenced by spatial effects (spatial non-stationarity) of adjacent stands and that many ecological processes (such as DW) follow similar rules in space24. Therefore, British scholars Brunsdon and Fotheringham25 based on the spatial coefficient of variation, the geographically weighted regression (GWR) model is proposed25. This model method has been applied to various industries and has good results26,27. The fitting result of GWR model is related to the choice of bandwidth and weighting functions28,29.

GWR can well support the analysis of spatial non-stationarity in a dynamic environment, but complex geographic processes are difficult to model30,31. First, GWR cannot select the most accurate kernel function and determine the correct kernel function through the cross-validation method, and it is unable to estimate the spatial non-stationarity in a complex geographical environment accurately32. Second, the form of GWR kernel function needs to satisfy the conditions of a priori hypothesis. Third, the relative simplicity of the kernel function can only solve simple nonlinear problems but cannot solve complex nonlinear interactions in the geographical environment33,34. For this reason, Du35 proposed the use of the spatially weighted neural network (SWNN) to replace the kernel matric for the spatial coefficient of variation construction to perform and put forward a GNNWR model to solve complex non-stationary and nonlinear problems in coastal environments. Further, the instability of the forest ecosystem in the process of the ecological environment poses a similar issue. In this paper, we study SWNN combined with logistic regression and Poisson regression models, respectively, a geographically neural-network-weighted logistic regression (GNNWLR) model and a geographically neural-network-weighted Poisson regression (GNNWPR) model are introduced.

We tend to think about the Liangshui National Nature Reserve as the study area and established the relationship between a series of terrain and stand variables on the ODW (binary) and NDW (count). They include (1) logistic regression (GL, GWLR, and GNNWLR) models and Poisson regression (GL, GWPR, and GNNWPR) models to terrain and stand variables, respectively, in order to predict the ODW and NDW; (2) compare the performance of different models fitting, predicting, and residual Moran’s I; (3) the spatial distribution law of model coefficients in different geographical locations was intuitively displayed and analyzed by the GIS technology.

Materials

Study area

Liangshui National Nature Reserve is affiliated with the Northeast Forestry University, and the study area is located in Yichun City of northeast China (Fig. 1). It belongs to the southeast section of the Xiaoxing’an mountains range, the eastern slope of the Daridailing branch, with a gross area is 12,133 ha, the core area of 6394 ha is the study area. There are many primitive Pinus koraiensis preserved in China and secondary birch and broad-leaved forests covering different succession stages of mixed broad-leaved primitive Pinus koraiensis forests. Other major tree species includes Spruce, Fir, Juglans mandshurica, Betula platyphylla, etc.

Figure 1
figure 1

Study area (Liangshui National Nature Reserve). ArcGIS10.4 was used to draw the maps.

The data comes from the 2019–2020 Forest Resources Planning and Design Survey. There are 32 compartments and 464 sub-compartments; 31 compartments and 443 sub-compartments belong to the arbor forest land. We have also carried out down deadwood volume (including down wood, standing deadwood, and others) surveys on the plots36. In order to further manage the area, according to the actual situation, we investigated the occurrence and number of valuable down wood through corner gauge points to sub-compartments in this research. Meanwhile, the tree species, number of living trees (NLT), forest stand mean height (H), and diameter at breast height (DBH) in sub-compartments were summarized by using an angle gauge. Moreover, terrain factors, such as DEM (m), slope (°), aspect and stand factors, mean age of living trees (years), canopy, vegetation coverage (%), etc., are also recorded here.

Variable selection

This study aims to utilize terrain and stand variables to introduce independent and dependent variable models. The independent variables are ODW (binary) and the NDW (count). The final retention was determined by using stepwise regression (significance level \(\alpha = 0.05\))37,38 and following five dependent variables: the number of living trees (NLT), canopy, forest stand mean height (H), slope, and forest stand mean DBH (DBH). Table 1 describes statistical variables related to statistical techniques.

Table 1 Description of the basic statistics of dependent variables and independent variables.

In order to study the non-stationarity in each direction, the variation of each variable along the longitude and latitude is calculated in Fig. 2. Considering the NDW, the incidence of the DW under different longitudes, latitudes, and the affect DW variables under any latitudes and longitudes, the following results are observed: the probability of more than half, the NDW is observed commonly in 0–25 n/ha, with an increase in the longitude the phenomenon of the DW appears for more than 50 n/ha, and DW more frequently in middle latitudes in the region, but cannot clear the trend of change. Moreover, it can be seen from the figure that the variation trends of the longitude and latitude of the other five independent variables are also inconsistent. Further, the longitude of the slope gradually increases and along the latitude of slope gradually decreases and then increases, its value is the lowest in the mid-latitude region. In conclusion, the variation of each variable significantly changes along the longitude and latitude, which further indicates that the stand environment shows high spatial non-stationarity.

Figure 2
figure 2

The trend of each variable along the longitude and latitude.

Methods

GL models

In many practical cases, the response variable is essentially binary39,40. That is, there are two possibilities in the result, namely, which are assigned a value of 0 (does not happen) or 1 (does happen). Here, we set ODW (y = 1) as P, and no ODW (y = 0) as 1−P. Logistic regression between the occurrence probability of the DW and their respective variables was established as shown in Eq. (1)20,41:

$$Log{\text{it}}(P) = \ln (P/(1{ - }P)) = \beta_{0} + \sum\nolimits_{k = 1}^{5} {\beta_{k} x_{ik} }$$
(1)

where P represents the probability of the ODW and \(\beta_{0} \sim \beta_{k}\) represents the regression coefficient of the model. k is the number of independent variables.

The Poisson model is usually attributed to count data. Here, the Poisson model can be utilized to predict the NDW, in line with the Poisson variable \(Y\) and the Poisson probability density distribution function \(f(Y,\mu ) = e^{ - \mu } \mu^{Y} /Y!\), where \(\lambda\) is the mathematical expectation and variance of the random variable \(Y\), namely, \(E(Y) = \lambda\) and \(V{\text{ar}}(Y) = \lambda\); a monotonous average link function of response variables as a linear model is obtained by inducing some changes as shown in Eq. (2)21:

$$Log(E(Y)) = L{\text{o}}g\mu = \beta_{0} + \sum\nolimits_{k = 1}^{5} {\beta_{k} } x_{ik}$$
(2)

It is assumed that the observed values are independent of each other, where \(\mu\) represents the NDW and \(\beta_{0} \sim \beta_{k}\) represents the regression coefficient of the model. k is the number of independent variables. All models estimate \(\beta_{0} \sim \beta_{k}\) by the maximum likelihood method.

GWR models

However, the above GL (logistic and Poisson) models are global in nature. The data collected in different geographical locations shows completely different results in the actual forestry survey due to the interference of different geographical environments and stand factors42. In the GL models analysis, it is often assumed that the estimated value of model coefficients are independent of the geographic location of the collected data, which leads to the estimated results tending to an average value. Thus, it can be inferred that all sample locations are based on unbiased estimates43. Therefore, the GL models show certain limitations in their application35. At present, GWR models are commonly used for improving the non-stationarity of space44,45.

Among them, we use the geographically weighted logistic regression (GWLR) model46,47 to forecast the ODW. Eq. (3) is expressed as follows:

$$Log{\text{it}}(P(u_{i} ,v_{i} )) = \ln (P(u_{i} ,v_{i} )/(1{ - }P(u_{i} ,v_{i} ))) = \beta_{0} (u_{i} ,v_{i} ) + \sum\nolimits_{k = 1}^{5} {\beta_{k} (u_{i} ,v_{i} )X_{ik} }$$
(3)

where \(\beta_{0} (u_{i} ,v_{i} )\sim \beta_{k} (u_{i} ,v_{i} )\) represents the coefficient of GWLR at the position \(i\).

The geographically weighted Poisson regression (GWPR) model48 to forecast the NDW. Eq. (4) is expressed as follows:

$$\begin{gathered} Log(E(Y(u_{i} ,v_{i} ))) = L{\text{o}}g\mu (u_{i} ,v_{i} ) = \beta_{0} (u_{i} ,v_{i} ) + \sum\nolimits_{k = 1}^{5} {\beta_{k} (u_{i} ,v_{i} )X_{ik} } \hfill \\ \hfill \\ \end{gathered}$$
(4)

where \(\beta_{0} (u_{i} ,v_{i} )\sim \beta_{k} (u_{i} ,v_{i} )\) represents the coefficient of GWPR at the position \(i\). All models estimate \(\beta_{0} \sim \beta_{k}\) by the maximum likelihood method.

GNNWR models

However, the kernel function of GWR models is relatively simple, and it is not easy to accurately model the complex stand geographical environment. For this reason, we propose a geographically neural-network-weighted regression (GNNWR) model, which is similar to the GWR models and uses the form of neural networks for defining the spatial non-stationary relationship35. Here, we integrate GWR into GL models, and the GNNWLR and GNNWPR models are shown in Eqs. (5) and (6):

$$\begin{gathered} Log{\text{it}}(P \times w(u_{i} ,v_{i} )) = \ln (P \times w(u_{i} ,v_{i} )/(1{ - }P \times w(u_{i} ,v_{i} ))) \hfill \\ \begin{array}{*{20}c} {\begin{array}{*{20}c} {\begin{array}{*{20}c} {} & {} \\ \end{array} } & {} \\ \end{array} } & {} & {} & {} \\ \end{array} = \beta_{0} \times w_{0} (u_{i} ,v_{i} ) + \sum\nolimits_{k = 1}^{5} {\beta_{k} \times w_{k} (u_{i} ,v_{i} )X_{ik} } \hfill \\ \end{gathered}$$
(5)

where \(w_{0} (u_{i} ,v_{i} )\sim w_{k} (u_{i} ,v_{i} )\) are the weight estimated at \(\beta_{0} \sim \beta_{k}\) by using the corresponding logistic regression by the maximum likelihood method.

$$\begin{gathered} Log(E(Y \times w(u_{i} ,v_{i} ))) = L{\text{o}}g(\mu \times w(u_{i} ,v_{i} )) \hfill \\ \begin{array}{*{20}c} {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\begin{array}{*{20}c} {} & {} \\ \end{array} } & {} \\ \end{array} } & {} \\ \end{array} } & {} & {} & {} \\ \end{array} = \beta_{0} \times w_{0} (u_{i} ,v_{i} ) + \sum\nolimits_{k = 1}^{5} {\beta_{k} \times w_{k} (u_{i} ,v_{i} )X_{ik} } \hfill \\ \end{gathered}$$
(6)

where \(w_{0} (u_{i} ,v_{i} )\sim w_{k} (u_{i} ,v_{i} )\) are the weight estimated at \(\beta_{0} \sim \beta_{k}\) by using the corresponding Poisson regression by the maximum likelihood method.

The spatial weight \(w(u_{i} ,v_{i} )\) is expressed as shown in Eq. (7):

$$w(u_{i} ,v_{i} ) = \left[ {\begin{array}{*{20}c} {w_{0} (u_{i} ,v_{i} )} & 0 & 0 & 0 \\ 0 & {w_{1} (u_{i} ,v_{i} )} & 0 & 0 \\ 0 & 0 & {...} & 0 \\ 0 & 0 & 0 & {w_{5} (u_{i} ,v_{i} )} \\ \end{array} } \right]$$
(7)

In the GWR models, the spatial weights include the Gaussian function, double square function, etc., but their structure is relatively simple, so it is difficult to capture the complex relationship between the spatial distance and the non-stationary weight. Here, the calculation of the SWNN can be deemed to a complex nonlinear problem between the spatial distance and the weight33,34. The SWNN is utilized to develop the non-stationary weight matrix, and kernel weight is determined as expressed in Eq. (8):

$$w(u_{i} ,v_{i} ) = SWNN\left( {\left[ {d_{i1}^{S} ,d_{i2}^{S} , \cdots ,d_{in}^{S} } \right]^{T} } \right)$$
(8)

where \(\left[ {d_{i1}^{S} ,d_{i2}^{S} , \cdots ,d_{in}^{S} } \right]\) is the distance between point \(i\) and other samples.

The neural network is designed using the Keras deep learning framework, also known as the deep neural network49,50. By setting the initial learning rate, the information between adjacent layers is considered fully connected (FC) and passed to the hidden layers51,52,53 and the dropout algorithm is required to be used in the iterative process of the training models54. Here, we also set the batch size as the sample number of each training, defined as LeakyReLU55 a nonlinear activation function for each network layer. The expressions for each layer are as expressed in Eq. (9).

$$y_{l} = \sigma (w_{l}^{t} x_{l} + b_{l} )$$
(9)

where \(l\) is the number of all the layers in the training, \(x_{l}\) are features of input layers, \(w_{l}^{t}\) is the weight matrix, \(b_{1}\) is the offset parameter vector, \(y_{l}\) are features of output layers, and \(\sigma\) is the activation function.

When the training process encounters the triggering condition of early stop or the time of epoch reaches the set maximum, the training is stopped. Once the training process is completed, the prescient capacity of the model is estimated by utilizing a validation set. In this paper, Huber56 was used as the training loss function of the model, and the mean absolute error (MAE) of the validation set was used as the over-fitting evaluation index. By setting the maximum epoch, the trend of the MAE of the training set and the validation set was analyzed to find the optimal model parameters under the optimal number of iterations. To this end, its overall design framework of GNNWR (GNNWLR and GNNWPR) is shown in Fig. 3. Hyper-parameter settings of GNNWLR and GNNWPR models are shown in Table 2.

Figure 3
figure 3

GNNWR (GNNWLR and GNNWPR) with the estimation of the design framework (a) MAE of epoch for GNNWLR (b) and GNNWPR (c).

Table 2 Hyper-parameter settings of GNNWLR and GNNWPR.

Model assessment

The model's performance is assessed by utilizing the coefficient of determination (R2), which is utilized to assess the variability of the estimates, the standard deviation of the root-mean-square error (RMSE), and MAE estimates of the prediction error. The accuracy (acc) of 0 and 1 prediction under different logistic models was also analyzed. Moreover, the revised Akaike information criterion (AICc) is utilized to select the best bandwidth of the GWR models. The residual of the model is defined as the difference between the observed values and the predicted values, and the spatial autocorrelation of each model residuals was analyzed by using correlation graphs of global Moran’s I coefficients across a lag distance36,57,58,59.

Results

Model assessment

A total of 355 samples were randomly selected from 443 samples of the study area for model fitting and 88 samples for model validation. The standardized independent variables are NLT, canopy, H, slope, and DBH. The accuracy results are shown in Table 3. It can be seen that for both the training set and the validation set, the GNNWR (GNNWLR and GNNWPR) models are superior than GWR (GWLR and GWPR) models, and the GWR models are better than GL (Logistic and Poisson) models.

Table 3 Model accuracy verification statistics.

In order to verify that the spatial effects of GNNWR models include spatial non-stationarity, comparing spatial effect processes of different models can reduce the ability to misleading significance testing and prediction models. Moran’s I and Z-value are calculated in Table 4. In order to better compare the spatial relationship of residuals of different models, spatial correlation graphs of residuals of different models are drawn with an interval of 300 m. Their average distance is about 300 m. Its neighbor pairs are 302, 1318, 2150, 2631, 3293 and 3751, respectively (Fig. 4).

Table 4 Moran’s I and Z-value for predicting ODW and NDW model residuals.
Figure 4
figure 4

The spatial correlation between the residuals of ODW (a) and NDW (b) models.

From Table 4, it can be seen that GL and GWPR models (Z-value > Z \(\alpha /2\) = 1.96) indicate that the independent assumption of model residuals is contrary to these two types of models, and these results indicate a similar clustering pattern. In general, the ability to eliminate the autocorrelation of spatial residuals can be gleaned from the absolute value of Z-value, and it follows the order: GNNWR models > GWR models > GL models. It indicates that GNNWR models can effectively eliminate spatial non-stationarity. Moran’s I of GNNWR models are also relatively stable at any step size and close to 0, which indicates that GNNWR models demonstrate a good ability to maintain spatial stability.

Model parameter analysis

According to the geographical location, the two types of models, GWR and GNNWR models, belong to local models and produce different geographical model coefficients. The descriptive statistics of the GWR and GNNWR model coefficients are summarized in Table 5. The distribution of GWR and GNNWR coefficients can be used to the non-stationary relationship between dependent and independent variables of stand environmental parameters33,34. The GWR and GNNWR models' coefficients show positive and negative fluctuations, indicating that the influence of the stand environment on DW may show opposite effects in different locations. Furthermore, the variation of the GNNWR coefficient is more dramatic than that of the response GWR coefficients, which may be the reason why the fitting and prediction performance of GNNWR has dropped significantly.

Table 5 Basic statistic parameters of GWR and GNNWR models.

Visualized analysis of model parameters

The visual distribution analysis of five predictive variables (NLT, canopy, H, slope, and DBH) is shown in Fig. 5a–e and according to the compartments. For further study the spatial correlation between ODW and NDW, the distribution diagrams of the model coefficients of the five predictor variables of the GNNWLR (Fig. 5f–j) and GNNWPR (Fig. 5k–o) models are presented. For the convenience of analysis, we divided the study area into nine orientations ((Fig. 5p). In terms of the NLT (Fig. 5a), there are fewer NLT in the N and S areas and more in the M area. Canopy (> 0.7) (Fig. 5b) is mainly distributed in the S and N areas; H (Fig. 5c) is higher in the N and SE. There are many rivers and roads in M area of Liangshui Nature Reserve. The slope in this area is slow (< 11°) (Fig. 5d). The DBH (Fig. 5e) of big living trees is > 35 cm and mainly clustered in N, W, and E areas. Therefore, it can be seen that in the N and SE areas, there are trees with less NLT and big DBH. The M area has medium-sized trees with a gentle slope. In the S area, there are more small trees with more NLT. Their slope is steep in the N, NE, E, and SE areas. The coefficients vary significantly in space and show several directional patterns, and the two symbols and sizes are often heterogeneous, which also explains the significant spatial non-stationarity of the DW space.

Figure 5
figure 5

Spatial distribution of the 5 predictive variables NLT, canopy, H, slope, and DBH (ae). Coefficient estimation of 5 predictive variables in GNNWLR (fj) and GNNWPR (ko) models. (p) is the divide the study area into 9 orientations. ArcGIS10.4 was used to draw the maps.

Discussion

As the study area of Liangshui National Nature Reserve, there is a large DW affected by the wind storms, especially most of the trees are old primitive mixed broadleaf-conifer forest. The trees that have passed the mature stage become more and more vulnerable in the face of wind as the H taller, and the canopy grows larger, the physiological aging and the ability to resist diseases and insect pests decrease, and most of them end their life cycle in the state of breakage or uprooting60,61 and such disaster changed its native forest stand structure62, which is associated with the hardwood species changing into the broad-leaved mixed forest and shows the positive pioneer trees, such as oak, birch, and aspen63. This indicates the formation of the secondary forest. Not only that, but it also leads to the loss of a lot of valuable timber in this area64. Therefore, it shows the economic value to the region (not rotten wood), and buck and forest investigate the environmental factors by using the data of the ODW (binary)19,20 and the NDW (count)21 to establish the GL, GWR, and GNNWR models, respectively. To better understand spatial non-stationarity caused by DW, the terrain with obvious gradient change and stand environment are used in the study area.

Here, we compare the three methods and find that the non-stationary matrix constructed by the SWNN can well solve the complex nonlinear problem between the weight space distance and the weight space and thus can better predict the spatial change of the DW33,34. This method also has good results to solve complex non-stationary and nonlinear problems in coastal environments33,34,35. It is found that the GNNWR model shows absolute advantages in both the training set and the validation set 35. At the same time, we utilized Moran’s I and Z-value to check the non-stationarity of model space and observed that GL and GWR models would have independent assumptions and inefficient model coefficient estimates. In addition, according to the spatial residual correlation graph drawn at every 300 m, the Moran’s I of GNNWR models at any step size is relatively stable and close to 0, which indicates that GNNWR models show a very good ability to maintain spatial stability65. At this moment, drop lines (logistic regression) and point plot (Poisson regression) with the ground survey data, respectively, and draw its 1:1 linear fitting diagram. The results of GNNWR models are closest to the ground-truth data (Fig. 6).

Figure 6
figure 6

Correlation logistic and Poisson (GL models (a, d), GWR models (b, e), and GNNWR models (c, f)) and ground-truth data of comparative analysis.

The distribution of ODW and NDW are easily affected by the climate, stand, terrain, and tree factors61. For the investigation, variables in the Liangshui Nature Reserve, four stand variables (NLT, canopy, H, and DBH), and one terrain variable (slope)66 were selected by stepwise regression. NLT represents the density of the stand, and the canopy represents the size of the canopy structure. The DBH and H represent the tree size67,68,69; the slope represents the geographical characteristics of stands66.

In GNNWR models, the symbols and sizes of the model coefficients are often heterogeneous; that is, different stand environments show different or even opposite effects on the distribution of the DW33,34. Here, we analyze the 5 variables of GNNWLR and GNNWPR models and the 9 orientations of model coefficients(ODW and NDW), and we can judge whether they are positively or negatively correlated with the corresponding dependent variables from the positive and negative mean values of variable model coefficients. However, we can see from the range of the box that these are not the only relations (Fig. 7), which will vary according to different regions, which also shows the existence of non-stationarity in the spatial model35. In the E and W orientations, the ODW is high and the NDW is large64. In the N, NE and NW orientations, the ODW is high, but NDW not large. In the region of N and NE, NLT is less, but the trees size and the tree canopy are large and the slope is steep. However, in the NW area, trees are of medium size and more NLT. However, SE, S and SW orientations are not easy to have DW, and the NDW is small. The tree size is small, the canopy is big, NLT is medium, the slope is gentle. In the M orientation, there are certain ODW and NDW63.

Figure 7
figure 7

The statistical analysis of 5 normalized variables and coefficients of ODW and NDW in 9 orientations under GNNWR models.

Finally, the predicted ODW and NDW on the visualization distribution maps of GNNWR closest to the ground-truth results, combined with statistical graphics and GIS mapping ability70, can provide inverted wooden key visual information, and forest managers can distinguish areas that are affected by disaster in order to help the administrator to provide timber and forest terrain variables and the relationship between the detailed information. Therefore, GNNWLR models are used to evaluate the ODW in a given area or small class, and GNNWPR models are used to predict the NDW. In order to prevent wind damage in the future, stand density, structure, and species composition can be changed in local areas63, which can assist in decision making and management planning for rational afforestation and management activities in the natural forest ecosystem.

Conclusion

Logistic regression (GL, GWLR, and GNNWLR) models and Poisson regression (GL, GWPR, and GNNWPR) models were used to model terrain variables and stand variables, respectively, for predicting the ODW and NDW. The analysis indicated that GNNWR models offer greater advantages than GWR and GL models in model-fitting and validation performance and also produced ideal residuals to validate spatial correlation. In addition, the GIS technology provides much useful information in the study area about different ODW and NDW caused by spatial non-stationarity to stand and terrain factors. For a given area, the spatial distribution information of the DW can be clearly identified, which can be considered a good approach to assess the damage caused by natural disasters, provide key information for forest resources, assist in decision making and management schemes, and avoid and reduce the disturbance and loss caused by natural disasters.