Introduction

Forests are the largest C repository in terrestrial ecosystems1. Comprehensively improving the stability of forest ecosystems and ecological service functions, and increasing forest utilization are effective ways to address current climate change. However, the problem of difficult use of forestland in dry and barren mountainous has become increasingly prominent. Severe rocky desertification2, and insufficient nutrient and water supplies have slowed plant growth and development3,4. Therefore, understanding the soil environment in mountainous areas will help reveal the impact of soil quality on sustainable forest management.

Soil is an important environment for regulating nutrient balance and plant growth and development in forest ecosystems5, and different forest types have different soil environments6. Soil quality is a comprehensive reflection of soil physical, chemical and biological characteristic7. Soil moisture and nutrients are crucial in the energy flow and transmission of the ecosystem, and they reflect the status of soil quality8. Soil extracellular enzymes are derived from soil microbial metabolic activities, plant root secretion and animal residue decomposition9. Extracellular enzyme activities can reflect the functional characteristics of soil microbes and participate in C, N and P. absorption and utilization in soil biochemical reactions10. The soil C-acquiring enzyme β-1,4-glucosidase(BG) can be used to catalyse the C cycle, the soil N-acquiring enzymes β-1,4-N-acetylglucosaminidase (NAG) and L-leucine aminopeptidase (LAP) are responsible for peptidoglycan and leucine decomposition, and the soil P-acquiring enzyme acid phosphatase (AP) can catalyse organophosphorus chemical mineralisation11. Previous studies have reported that the biological characteristics of forest soil quality are catalase activity (CAT), urease activity (UR) and protease activity (PR)12,13,14, but the four major acquisition enzymes of soil C, N and P and vector properties are rarely used for soil quality assessment. Therefore, the application of extracellular enzymes and vector properties is conducive to understanding the biological characteristics of the dry and barren mountainous ecosystem in the Yimeng Mountainous Area, China, and is critical in the impact of microbial metabolism on soil quality against the background of global climate change.

Soil quality cannot be measured directly, and a comprehensive assessment of soil indicators is needed to quantify soil quality15. Soil quality assessment methods include the soil quality index (SQI), soil productivity index model (PI), and system analysis method16,17,18. At present, the most used soil quality assessment method in agriculture and forestry is the SQI19,20, but there are many soil indicators involved, and it is difficult to obtain data on all soil properties, and information overlaps. Therefore, simplifying the evaluation indicators and constructing the minimum data set (MDS) for soil quality assessment21 could reduce the workload, reduce the cost, and prevent data redundancy22,23. Zhang et al.24 used the MDS to assess the soil quality at different stages of vegetation succession in China's karst areas, and the soil quality at the stage from farmland to secondary forest showed an increasing trend. Zhao et al.25 used the minimum data set to assess the different ages of Larix principis-rupprechtii, and the soil quality gradually improved within 16 to 44 years. Therefore, it is essential to establish an MDS to assess the soil quality of forest communities in a specific area.

We researched the soil parameters, and four physical properties, six chemical properties, and six biological properties were determined in five communities, combined with principal components and correlation analysis to establish a minimum data set. We compared linear and non-linear models for soil quality assessment and analysed the factors influencing soil quality in different communities of dry and barren mountains in the state-owned Dawa Forest Farm in the Yimeng Mountain Area, China.

Our aims were (1) to establish an MDS with a proper indicator for soil quality assessment through physical, chemical and biological characteristics; (2) to compare and analyse the applicable between linear and non-linear scoring method for soil quality assessment; and (3) to evaluate the soil quality of different communities using the SQI method and determine the affecting indicators. The results of this study can provide a more appropriate data reference, which is essential to realize the effective use of forest soil and the stability of the ecosystem.

Materials and methods

Study site

The sites were in the state-owned Dawa Forest Farm of the Yimeng Mountain area of Shandong province, China (latitude 35° 30′ 2.04″ to 35° 30′ 24.43″ N and longitude 117° 55′ 44.14″ to 117° 56′ 3.36″). The mean annual temperature of this region is 12.8 ℃, the mean annual precipitation is 600 mm. The study site has a warm temperate continental monsoon climate, and a frost-free period 191 d. The sediment is dominated by gneiss, and brown forest soil, the average thickness of the arbor soil layer is approximately 20 cm.

Five typical communities with consistent slope orientation, slope position, altitude, and growth were selected as study plots in October 2019: herb, shrub, Quercus acutissima, Pinus thunbergii, and Quercus acutissimaPinus thunbergii mixed-plantation plots. The forest plots were 30-year-old plantations, the main species of vegetation in this area are Quercus acutissima, Pinus thunbergii, Vitex negundo, Cymbopogon goeringii, Achnatherum pekinense, Viola collina, Conyza canadensis, and Artemisia stechmanniana.

Experimental design and soil sampling

Three plots of 5 × 5 m were established in the herb community, three plots of 10 × 10 m were established in the shrub community, and three plots of 30 × 30 m were established in each of the Quercus acutissima, Pinus thunbergii, and mixed-plantation communities. The spaces between adjacent plots were at least 10 m. In each plot, soil samples were collected using a soil auger (diameter, 4 cm) from the 0–15 cm soil layer at 15 random points and then mixed into a composite sample as one replicate. Samples were collected from a total of 225 random points across the different sites, with three plots per community as three independent replicates. In total, 15 composite samples were established26. A subsample of each composite sample was immediately placed in an ice box, transported to the laboratory, and then stored at 4 °C for the analysis of extracellular enzyme activities within two weeks. The other subsample was air-dried for physicochemical analysis (Table 1).

Table 1 Methods used in laboratory analyzes for selected indicators, Particle size: sand, silt and clay, extracellular enzyme: β-1,4-glucosidase(BG), β-1,4-N-acetylglucosaminidase (NAG), L-leucine aminopeptidase (LAP) and acid phosphatase (AP).

Soil quality index

Evaluation steps of SQI: (1) Pearson conducts correlation analysis of soil indicators, PCA was used to group the indicators, and the component with eigenvalue ≥ 1 was selected. The indicators with loadings ≥ 0.5 in the same component were classified into one group. If the loadings of one indicator in different components were ≥ 0.5, the indicator was classified into the group where the indicator had the lowest correlations with other indicators, the total data set was built, and the norm value of each group of indicators was calculated. (2) The indicator whose norm value was within the 10% range of the maximum total value of a group was selected for further correlation analysis. If the indicators were significantly correlated, then the indicator with the highest norm value was retained in the minimum data set, and all others were eliminated. The noncorrelated indicators were considered important and retained in the minimum data set in the same group. (3) After determining the minimum data set for the soil quality index, each soil indicator was transformed into unit-less scores ranging from 0.00 to 1.00 using linear and non-linear scoring function methods30,32.

The Norm Eq. (1) was used as follow:

$${N}_{ik}=\sqrt{{\sum }_{j=1}^{k}\left({u}_{ik}^{2}{e}_{k} \right)}$$
(1)

where Nik is the norm value of k PCs with eigenvalues ≥ 1 for variable i, uik is the loading of soil variable i in component k, and ek is the eigenvalue of component k.

The following Non-linear curves were used as sigmoidal type equation Eq. (2) scoring functions:

$${S}_{NL}=\frac{a}{1+{(x/{x}_{0})}^{b}}$$
(2)

where a is the maximum value (defined as a = 1 in this study) reached by the function, x is the value of the selected indicator and x0 is the mean value of each indicator corresponding to the soils. b is the slope of the equation and was set as − 2.5 for “more is better” and + 2.5 for “less is better” functions.

The following linear curves were subsequently used as “more is better” Eq. (3) or “less is better” (Eq. (4)) scoring functions:

$${S}_{L}=\frac{{y}_{i}-{y}_{mi n}}{{y}_{max}-{y}_{min}}$$
(3)
$${S}_{L}=1-\frac{{y}_{i-}-{y}_{min}}{{y}_{max}-{y}_{min}}$$
(4)

where yi is the measured index value, ymin is a soil variable of the minimum value, and ymax is a soil variable of the maximum value

$$\mathrm{SQI}=\sum_{i=1}^{n}{w}_{i}{s}_{i}$$
(5)

where SQI is the comprehensive soil quality score index, Wi is the weight of the ith evaluation indicator, si is the index score, and n is the number of evaluation indicators.

Statistical analysis

Vector length, representing microbial C limitation, was calculated as the square root of the sum of (lnBG/ln(NAG + LAP))2 and (lnBG/lnAP)2 (Eq. (6)). The vector angle, representing microbial N or P limitation, was calculated as the arctangent of the line extending from the plot origin to point (lnBG/lnAP, lnBG/ln(NAG + LAP) Eq. (7))31. The equations are as follows:

$$\mathrm{Vector length}=\sqrt{{\left(lnBG/\mathrm{ln}[NAG+LAP]\right)}^{2}+{\left(lnBG/lnAP\right)}^{2}}$$
(6)
$$\mathrm{Vector angle}=Degrees\left(ATAN2\left(lnBG/lnAP\right),\left(lnBG/ln\left[NAG+LAP\right]\right)\right)$$
(7)

The mean and standard deviation were calculated using SPSS Statistics 22 software. One-way analysis of variance and Duncan analysis were used to compare the differences between communities. The significance test was carried out at alpha = 0.05. Excel was used to process the data. Pearson correlation was used to analyse the correlation between soil index variables. Principal component analysis (PCA) was used to simplify the data analysis, reduce the dimensionality of complex data, and solve the problem of multicollinearity among explanatory variables. Linear regression analysis determined the relationship between the minimum data set and the total data set. All bar graphs were drawn using Origin 2018.

Results

Establishment of the minimum data set

As a result, the indicators were determined with eigenvalues ≥ 1 in four PCs, which accounted for over 90.883% of the variation in the soil characteristics. This agreed with the requirements of information extraction (Table 2). The 30.255% variation was due to the first PC1, and more than 20% of the variation was due to PC2 and PC3, while PC4 accounted for 18.959% of the variation. The weight values from the TDS indicators were relatively low and similar. In general, indicators with absolute factor loading values ≥ 0.50 were considered highly weighted PCA indicators, which could be first selected from each PC, and divided into 5 groups according to the PC results. Pearson correlation analysis was used to check the correlation between these indicators to reduce redundancy (Fig. 1). The minimum data set (MDS) selected indicators whose Norm value was within 10% of the highest value of the group and eliminates them with strong correlation. Finally, the MDS related indicators were determined as silt, total N, total P, LAP, AP, and vector length.

Table 2 Principal component analysis results and standard values of 16 soil properties and soil quality indicator weights in the minimum data set.
Figure 1
figure 1

Correlation heatmap between the soil physicochemical properties, soil enzyme activities, and soil microbial nutrient limitation (P < 0.05). The value of the correlation coefficient is displayed in the circle in the figure, red represents a positive correlation, blue represents a negative correlation, the shade of the colour represents the strength of the correlation, and the × in the figure represents no significant correlation.

Scores and weights of the MDS indicators

There were significant differences (P < 0.05) between linear and non-linear scoring of the minimum data set indicators for different communities (Fig. 2). The score values of the silt indicator ranged from 0.23 to 0.83 and 0.30 to 0.68 for the linear and non-linear models, respectively. The linear and non-linear total N score values followed the order shrub > Pinus forest > herb > Quercus forest > mixed plantation, and there was no significant difference between the Quercus forest and mixed plantation. The score values of total the P indicator ranged from 0.09 to 0.84 and 0.36 to 0.68 for the linear and non-linear models, respectively. The score values of the total LAP indicator ranged from 0.02 to 0.95 and 0.22 to 0.79 for the linear and non-linear models, respectively. The linear and non-linear AP score values followed the order herb > shrub > Quercus forest > Pinus forest > mixed plantation, while the linear and non-linear vector length scores followed the order mixed plantation > shrub > Pinus forest > Quercus forest > shrub > herb. Generally, the linear scores of the MDS indicators were consistent with the non-linear scored ranking results of the five communities.

Figure 2
figure 2

Linear and non-linear score values of the MDS; values are the means ± standard error of the mean. The different letters indicate significant differences between treatments according to the Tukey test; lowercase letters (such as a, b, c, d) indicate significant differences (P < 0.05) among the five communities. (AC): soil physicochemical properties score; (D,E): soil enzyme activity score; (F): soil microbial nutrient limitation.

In the non-linear model, total N, total P, LAP, and AP were the more the better indicators; silt and vector length were the less the better indicators (Table 3). In the linear function, total N, total P, LAP, and AP were calculated by Eq. (3) to calculate the soil quality index, and silt and vector length were calculated by Eq. (4) to calculate the soil quality index. Vector length had the highest weighting that resulted in the highest contribution to the soil quality index, and total N had the lowest weighting.

Table 3 Type of scoring curves, the parameters of non-linear and linear equations, and calculated weights for the minimum data set.

Accuracy verification of the MDS

The rationality verification of the MDS evaluation index system was an important part of SQI evaluation in the Yimeng Mountain Area, China (Fig. 3). The TDS and MDS of linear and non-linear exhibited a significant positive correlation (P < 0.001), the correlation coefficients determined using the non-linear method (R2 = 0.675) were higher and more accurate than those determined using the linear method (R2 = 0.628), indicating that the results of the two methods were in good agreement. Overall, the MDS method, like the TDS method, can be used as a feasible way of assessing soil quality.

Figure 3
figure 3

Linear regression analysis between MDS and TDS for linear and non-linear models. Linear-TDS-SQI: Linear soil quality index of the total data set, Linear-MDS-SQI: Linear soil quality index of the minimum data set, Non-linear-TDS-SQI: Non-linear soil quality index of the total data set, Non-linear-MDS-SQI: Non-linear soil quality index of the minimum data set. (A): The fitting curve of linear MDS and TDS, (B): the fitting curve of non-linear MDS and TDS.

Soil quality index and analysis of contribution rate

The soil quality index (SQI) of the Pinus forest was higher than that of the other communities, and the mixed forest was similar to the herb in the linear and non-linear models (Fig. 4). The SQI values derived with the linear MDS method ranged from 0.37 to 0.59, the SQI values calculated by the non-linear MDS method ranged from 0.44 to 0.54. The SQI values based on the MDS method of linear and non-linear under different communities showed the same ordering from highest to lowest. The comparative SQI using linear- MDS-SQI Eq. (6) and non-linear- MDS-SQI Eq. (7) methods can be described as follows:

$$\begin{aligned} {\text{Linear - MDS - SQI}} = & \left( {0.{159} \times S_{{L({\text{Silt}})}} } \right) + \left( {0.{146} \times S_{{L({\text{N}})}} } \right) + \left( {0.{159} \times S_{{L({\text{P}})}} } \right) \\ & \quad + \left( {0.{165} \times {\text{S}}S_{{L({\text{LAP}})}} } \right) + \left( {0.{182} \times S_{{L({\text{AP}})}} } \right) + \left( {0.{189} \times {\text{S}}_{{L({\text{Vector length}})}} } \right) \\ \end{aligned}$$
(6)
$$\begin{aligned} & {\text{Non - linear - MDS - SQI}} \\ &\quad = 0.{159} \times \left( {{1}/\left( {{1} + S_{{NL({\text{Silt}})}} /{31}.{956}} \right)^{{{2}.{5}}} } \right) + 0.{146} \times \left( {{1}/\left( {{1} + S_{{NL({\text{N}})}} /0.{21}0} \right)^{{ - {2}.{5}}} } \right) \\ & \qquad + 0.{159} \times \left( {{1}/\left( {{1} + S_{{NL({\text{P}})}} /{1}.0{77}} \right)^{{ - {2}.{5}}} } \right) + 0.{165} \times \left( {{1}/\left( {{1} + S_{{NL({\text{LAP}})}} /{8}.0{89}} \right)^{{ - {2}.{5}}} } \right)\\ & \qquad + 0.{182} \times \left( {{1}/\left( {{1} + S_{{NL({\text{AP}})}} /{93}.0{71}} \right)^{{{2}.{5}}} } \right) + 0.{189} \times \left( {{1}/\left( {{1} + S_{{NL({\text{Vector length}})}} /{1}.{5}0{5}} \right)^{{{2}.{5}}} } \right) \\ \end{aligned}$$
(7)
Figure 4
figure 4

Characteristics of SQI under Yimeng Mountain area, SQI: Soil quality index. Linear-TDS-SQI: Linear soil quality index of the minimum data set (A), Non-linear-MDS-SQI: Non-linear soil quality index of the minimum data set (B).

The specific contribution of each linear MDS indicator to the SQI (Table 3) showed that AP had the highest contribution to the SOI of herbs. In contrast, the contribution was lower in the mixed plantation than in the other communities. Soil total N and P contributions were lower in the Quercus forest than in other communities. The specific contribution of each non-linear MDS indicator towards the SQI (Table 4) showed that six indicators contributed like to the SQI, and LAP had the lowest contribution towards the SQI of herbs. The soil total N and P contributions were lower in the Quercus forest, similar to the linear trend. N had the lowest contribution to the SOI of the mixed plantation.

Table 4 Relative contributions of selected soil indicators to SQI under different communities.

Limiting soil indicators for soil quality

Scores of all soil parameters were plotted in a radar diagram to explore the limiting soil indicators for the linear-SQI and non-linear-SQI (Fig. 5). When lines under different communities crossed the axes, scores of each indicator were projected on the web. Crossing points located on the edge of the web indicated better soil quality, and crossing points near the centre of the web represented worse soil quality. According to the comprehensive analysis of the radar chart of the scores of the two evaluation indicators, silt, available P, LAP, NAG, vector length and vector angle were the major factors influencing herbs; available P, P, NAG, vector length and vector angle were the major factors influencing shrubs; available P, LAP, AP were the main influencing factors of Pinus forest; soil moisture, available P, C and N were the major factors influencing Quercus forest; and soil moisture, pH, organic C, C, N, BG, LAP and AP were the factors influencing the mixed plantation. The results indicated that combined with the proportion of each element in the minimum data set, soil quality was relatively good in the Pinus forest, and the performance results of the influencing factors were similar in linear and non-linear models. The mixed plantation was affected by soil texture, moisture and nutrients, but organic C and C influence were important reasons for the lower soil quality compared to other communities.

Figure 5
figure 5

The limiting factors of soil quality index for SQI. (A): limiting factors of linear soil quality index for SQI, (B): limiting factors of non-linear soil quality index for SQI.

Discussion

Applicability of MDS indicators

The MDS for screening of the soil quality index indicators can reduce time and economic costs, and it is also the most widely used model for soil quality evaluation33,34. PCA combined with the norm value for MDS screening, effectively prevents the lack of important indicators in principal component screening35. In this study, 16 primary indicators were selected, 6 indicators of silt, total P, total N, LAP, AP and vector length were screened by MDS, and 62.5% of the indicators were screened and filtered, simplifying the evaluation of the soil quality index. The screening results included three aspects of physics, chemistry, and biology (Table S1), the indicators were more representative. The significant positive correlation of the linear model MDS and TDS (R2 = 0.628, P < 0.001), and significant positive correlation of the non-linear MDS and TDS (R2 = 0.675, P < 0.001), the correlation index can be used as the minimum data set determination index and the MDS method with similar accuracy to TDS method13,24. Thus, MDS reduces the number of indicators, and can be an effective replacement for the TDS method to evaluated soil quality of arid and barren mountainous.

The predecessors used the MDS to screen the index mostly including soil moisture, pH, silt, total N, total P, organic C, and available P13,14,36. In our study, silt, total N, and total P were retained in the MDS and were consistent with the results of previous studies36. However, pH, soil moisture, organic C, and available P were more researched and not selected, the selection criterion of the smallest data set was within 10% of the largest data in the group, and there was no correlation37. The pH, soil moisture and organic C were the same as the selected total N in the first group, and the norm values of soil moisture (2.001), pH (1.664) and organic C (2.014) compared with the maximum N (2.154) were 7.10%, 22.74% and 6.50%, respectively (Table 1). Soil moisture and organic C were correlated with N, which needed to be eliminated. The available P was the same as the selected LAP in the second group. The available P norm value (1.775) was within 10% of LAP (1.838) in this group, while was correlated with LAP, AP (1.803) was within 10% of LAP and not correlated, AP and LAP were selected as the MDS indicators. The vector length of enzymes in this article could be used to indicate the nutrient metabolism of microorganisms38, and represent the biological properties of the soil. The norm value of vector length was highest in the fourth group, therefore, we confirmed that vector length can be one of the MDS indicators in forest regions.

Soil quality assessment and influencing factors

Research has shown that the applicability of linear and non-linear index evaluation is related to location and index complexity. Yuan et al.23 evaluated soil quality in rice-crayfish farming in the Jianghan Plain using a linear method, which was simple to use and almost does not require prior knowledge of the system; only the threshold of each indicator needs to be understood, and the score of each indicator was determined by observation39,40. Qiu et al.41 evaluated soil quality in Larix principis-rupprechtii plantations in North China using a non-linear method, this method requires a full understanding of the characteristics of each indicator, and the operation is complicated42. While Andrews et al.34 believed that the non-linear index evaluation was more suitable for soil quality evaluation, the difference between treatments was smaller than linear. In this study, the MDS of these two assessment methods were significantly correlated with the TDS (P < 0.001, Fig. 3), and the SQI obtained of five communities by non-linear scoring model was in line with that obtained by linear, but the details were slightly different (Fig. 4), the F-values of SQI obtained using the non-linear scoring method (F = 14.498) in the ANOVA were all greater than the linear scoring method (F = 14.267). The result indicated that non-linear evaluation can balance the proportion of each index evaluation compared with linear evaluation13, thus, the non-linear method could better distinguish the soil quality of the five communities of arid and barren mountainous 43, the non-linear scoring method is more representative of the system function than the linear scoring function42.

The soil quality index was calculated using the integrated quality index Eq. 44. The results showed that the soil quality was relatively higher in Pinus forest than in other communities in the assessment (Fig. 4), the SQI was lower and similar to that of the herb and mixed plantation. This result was inconsistent in that the soil quality gradually increased from grass to shrub to tree in karst24. In this study, the forest was divided into artificial forest plantations for approximately 30 years. Our previous study indicated that the soil moisture was the lowest in mixed plantations45, which indicates a moderately dry period of soil. Appropriate soil gravel content increases soil ventilation and water permeability, while more sand content and less clay content will accelerate soil erosion46. The sand was highest and soil moisture was the lowest in mixed plantations, which affects soil water availability and increase ecological vulnerability45,47. Another consideration is that according to the Niche Theory48, the diameter at breast height and Quercus of the mixed plantation was lower than that of the Quercus pure forest, and the growth of the Pinus was not as good as that of the Pinus pure forest. The height of Quercus trees was higher than that of Pinus, which results in insufficient light and affects the photosynthesis of Pinus (Table S1). In the mixed plantation, there was intraspecific and interspecific competition between Quercus and Pinus, and understorey shrubs49, which was affected by various factors such as soil moisture soil structure, organic C, vegetation type and microbial nutrient metabolism12,50,51, the poor growth and lower SQI of the mixed plantation.

Radar plots of soil parameter scores can indicate limiting indicators of soil quality24. In this study, the influencing factors on the soil quality of herbs, shrubs, Pinus forests and Quercus forests were soil available P, LAP, NAG, and AP related to N- and P- acquiring enzymes. While the influencing factors in the soil quality of the mixed plantation were organic C, total C and BG, these were different from those of the other communities. The accumulation rate of organic C is different in the soil environments of different communities52,53. The soil organic C (13.60 mg g−1), total C (21.13 mg g−1) and BG (26.21 nmol g−1 h−1) of the mixed plantation were lower than those of other communities45, which are key indicators of soil quality54. The organic C and C of the mixed plantation were lower, resulting in a reduction in the carbon source provided and significantly affecting soil carbon sequestration and nutrient cycling55. This was also an important reason for the lower soil quality of the mixed plantation. Therefore, the soil quality was affected by the soil chemical properties and enzymes of different communities.

Conclusions

The improvement of soil quality in arid and barren mountainous areas plays an important role in enhancing the carbon sequestration capacity of forest communities. To evaluate the soil quality of five communities in arid and barren mountainous in the Yimeng Mountain area, China. Sixteen soil physical, chemical, and biological properties were determined, and significant correlations were shown between them. The MDS of six indicators (silt, total N, total P, LAP, AP. vector length) was established in this research for soil assessment. The SQI results showed that Pinus. forest was higher than those of the other communities, and the soil quality index of mixed plantation was lower and similar to the herb community. Soil chemical properties and enzymes influence the soil quality of different communities. Unlike other communities, the soil quality in the mixed plantation was affected by total C and organic C. Hence, in the subsequent production and management of forest stands, we should focus on maintaining the stability of the ecosystem, and strengthening the transformation of the mixed plantation.