Introduction

Modeling of water flow and chemical movement in unsaturated soils has been emphasized by soil scientists and hydrologists for different purposes, such as evaluations of root water uptake, soil erosion, and groundwater pollution risk. However, high variability and complexity of soil texture in natural field make direct measurements of soil hydraulic properties costly and time-consuming. It is desirable to utilize readily available information, such as soil texture and bulk density, to estimate soil hydraulic properties1,2,3. This kind of approach benefits the development of computationally efficient methods for evaluating soil hydraulic heterogeneity in watershed or agricultural field while ensuring the economic feasibility of field investigation efforts within acceptable accuracy. To date, many modeling efforts have been made to relate soil texture (expressed as particle size distribution), soil structural properties, bulk density, and/or organic matter content to soil water retention4,5,6,7. Soil water retention was estimated using multiple regression, neural network analyses, and other methods8,9,10,11,12,13,14. However, the applicability and accuracy of the models are more or less unsatisfactory. Several prediction models were derived on global soil hydraulic datasets, such as applying the Miller-Miller scaling approach to the soil dataset of SoilGrids1km to provide a global consistent soil hydraulic parameterization15, but some of them possess a high correlation to particular soil types and thereby may not be suitable for other soils16,17,18.

An important advancement in using soil particle size distribution to derive a soil water retention characteristic was the development of a physical empirical model by Arya and Paris19,20, Later, Haverkamp and Parlange21 proposed a similar model by combining physical hypotheses with empirical representations and tested the model on sandy soil. Tyler and Wheatcraft22 interpreted the empirical scaling parameter α in the Arya and Paris model as being equivalent to the fractal dimension of a tortuous fractal pore system. However, Arya et al.20 argued that the fractal scaling was limited in estimating water retention characteristics in the complex soil matrix. In the optimized model of Arya et al.20, three methods were proposed for calculating the scaling parameter α, but the calculation still involved empirical component to some extent, making the model sometimes relatively difficult for broad application. The physical basis of the model of Arya and Paris19 or Arya et al.20 is weakened by the assumption that the void ratio of bulk sample is equivalent to the void ratio of individual particle size class.

To improve the mathematical description of physical relations between soil particles and soil pores, we assume that different fractions of soil particles may make different contributions to the total porosity or volumetric water content in the bulk soils and that soil pore volume and associated bulk density are specific for particle size fractions. This line of thinking might help derive a better physical model for mathematical estimation of soil water characteristics. Therefore, the objective of this work was to apply a fractional bulk density (FBD) concept to the development of a soil water retention model that is effective for all soil textures and a wide range of soil bulk density.

Results

Estimation accuracy

Model estimation of water retention characteristics for some soils is presented in Fig. 1. The results indicate that the new procedure was in good agreement with the measured data for most of the soil textures except for sand in the range of water pressure head from 15 cm to 15,000 cm, which covers the entire range of available water content. Table 1 shows comparisons of the coefficient of determination (R2), root mean square error (RMSE), and t value of Student’s t distribution between the FBD model and the curve fitting using the Campbell model23, which was extended from the similar media concept24. The Campbell model is expressed as

$${\theta} = \left( {\frac{{\psi}_{e}}{\psi}} \right)^{\frac{1}{q}} {\theta}_{s}$$
(1)
Figure 1
figure 1

Water retention characteristics measured (circle) and estimated (line) using the fractional bulk density (FBD) model for eight different soil textures.

Table 1 Statistical comparison of soil water contents estimated by the fractional bulk density (FBD) model and fitted by the Campbell model23.

where ψe is air-entry water potential, θs is saturated volumetric water content, and q can be obtained using

$${q} = \left[ {\sum {M}_{i} \left(\ln {D}_{gi}\right)^{2} - \left(\sum {M}_{{{i}}} \ln {D}_{gi} \right)^{2} } \right]^{0.5}$$
(2)

In the equation, Dgi is the diameter of the ith particle-size fractions, and Mi is the cumulative mass percentage of the ≤ Dgi particles.

RMSE values were computed from soil water contents measured and estimated as described in the section of methods. Table 1 shows that the mean value of RMSE of the FBD model was 0.032 cm3/cm3 while that of the Campbell model was 0.024 cm3/cm3. This result was acceptable because the Campbell model used the measured data to fit ψe. The R2 values also supported the acceptability of the FBD model compared to the Campbell model. According to the t values, the FBD model results had no significant difference and systematic bias from the measurements for 25 out of the 30 soils. Figure 2 shows an overall comparison between the water contents measured and estimated by the FBD model for the 30 soils. The values coalesced to the 1:1 line with the RMSE being 0.041 cm3/cm3. This RMSE value was larger than the average in Table 1. The discrepancy was due that different methods were used for averaging the RMSE values for individual soils and all soils. Mayr and Jarvis25 presented pedotransfer functions to estimate soil water retention parameters of the Brooks–Corey model. The resulting mean RMSE value was 0.043 cm3/cm3 for the dependent dataset and 0.048 cm3/cm3 for the independent dataset. Tomasella et al.26 derived a pedotransfer function to predict the water retention parameters of the van Genuchten equation. The mean RMSE values ranged from 0.038 cm3/cm3 to 0.058 cm3/cm3. Our model compared favorably with these pedotransfer functions in terms of mean RMSE values. It could thus be concluded that the FBD model behaved overall well, except for Acolian sandy soil (Soil #01). For sandy soil, the relatively poor capture of the rapid change of water content was attributed to the limitation of applicability of capillary law (i.e., Young–Laplace equation) to sandy media and existence of macropores that might reduce the pore continuity8. The continuity of soil pores was the dominant factor that affected the performance of our proposed model.

Figure 2
figure 2

Comparison of measured and estimated volumetric water content using the fractional bulk density (FBD) model for 30 soils with ranges of soil texture from clay to sand and bulk density from 0.33 to 1.65 g/cm3. The circle represents measured values, and the line denotes a 1:1 agreement.

The FBD model also had relatively larger estimation errors for soils originated from ash parent materials (e.g., Soil #05, 14, 16, and 22) than for other soils (Table 1). This was due likely to the oversimplification of soil particle size distribution as a sigmoid curve, whereas the particle arrangement of soils developed from ash parent materials was actually very complex (i.e., non-sigmoid). The less accurate prediction for sandy soils relative to the other soil textures suggested that the sigmoid-shape assumption of particle size distribution might be arbitrary, despite it was well applied to the particle systems of other soil textures. We infer that the sigmoid-type distribution was more applicable to the soils with a broader range of particle sizes, which demonstrated a lognormal distribution of particle fractions27,28. Soil aggregates with hierarchical pore structure have dual-porosity system. Dual-porosity assumes that porous medium consists of two interacting regions, one associated with the macropore or fracture system and the other comprising micropores inside soil material. Bimodal pore size distributions are frequently observed in dual-porosity soil29. The water retention estimated with the FBD model for a wide range of water pressure head (15–15,000 cm) should thus be a sum of the effects of macropores and micropores30. The sigmoid-type distribution should be more suitable for hierarchical soil aggregates than for less structured soils, such as sandy soil whose pore system was simply dominated by primary particles. Therefore, the FBD model might not perform very well against the soils if their particle sizes have a narrow range.

Sensitivity analysis of model parameters

We performed a sensitivity analysis to identify input parameters that most strongly affected the model behavior and to determine the required precision of the key parameters. The parameters included in the sensitivity test were saturated water content (θs), residual water content (θr), rate coefficient (λ) of Logistic-type model for particle size distribution, and particle size distribution index (ε). The value of each parameter was assumed to increase or decrease by 20% of its actual value since its measurement error could be up to 20% according to our experience in field survey. By taking Soil #22 as an example, the test was implemented to monitor the change in the estimated soil water content caused by changing the value of one parameter at a time while others remained constant. The sensitivity analysis not only showed the influence patterns of the parameters on the model behavior but also ranked the parameters in terms of the magnitude of influences. Figure 3 shows that θs and θr had similarly large impacts on the model estimation. In comparison, λ and ε played less roles in defining the model performance, but their accuracy was still very important for the estimation accuracy. The sensitivity analysis provided insights into the behavior of the FBD model (Eq. 20) and supported the notion that parameter values may have physical meanings no matter in whatever ways the related parameters are structured into a model.

Figure 3
figure 3

A sensitivity analysis on the parameters of the fractional bulk density (FBD) model (Eq. 20). The analysis was based on a sandy clay soil (Andisols, Soil #22 in Table 2). θs, θr, ε, and λ refer to volumetric saturated water content, volumetric residual water content at a pressure head of 15,000 cm water, particle size distribution index, and rate coefficient in Eq. (15) for particle size distribution, respectively.

Discussion

Particle size distribution forms a common descriptor of natural soils. It has been used routinely as one of the inputs to estimate some of soil physical properties, for example, water retention characteristic31,32,33, bulk density34, and hydraulic conductivity35,36,37. In this study, two parameters, rate coefficient (λ in Eq. 15) of the Logistic-type model for particle size distribution and particle size distribution index (ε in Eq. 13), were employed to translate particle size distribution to soil water retention characteristic. However, two parameterization issues should be mentioned for broadening model applicability. One is the estimation of λ in the case that the upper size limit of the particle size distribution is 1,000 μm for some soils while it is 2,000 μm for other soils. In order to perform a consistent comparison among all soils, the particle size distribution with the upper limit of 2,000 μm was normalized to that with the upper limit of 1,000 μm using a normalization formula,

$${{M}}^{\prime}_{{{i}}} = \frac{{{{100M}}_{{{i}}} }}{{{{M}}_{{1,000}} }}$$
(3)

where Mi and Mi are measured and normalized percentage content of particles with sizes smaller than or equal to the ith particle size, respectively. M1,000 denotes the mass percentage of particles with a diameter smaller than or equal to 1,000 μm. The other issue is pertinent to the calculation of ε. It involved three particle sizes (D10, D40, and D60) below which the mass percentage of particles is 10%, 40%, and 60%, respectively. It is easy to identify D60 but sometimes relatively difficult to find D10 and D40. In some soils, the mass of particles with sizes smaller than or equal to the measured lower limit size (e.g., 1 μm or 2 μm) was larger than 10%. In this case, an exponential equation, which was obtained by fitting the relation between the cumulative mass percentage and the corresponding particle sizes, was used to extrapolate for estimating D10. To minimize the deviations arising from the extrapolation, we used 50 μm as the upper size limit of the particle size distribution.

There is no doubt that particle assembling and resulting pore characteristics play important roles in regulating physical, chemical, and biological functions of soils at various scales. The FBD model was generally based on the assumption that the sizes of soil particles and the density of their packing are the primary determinants of the pore size and pore volume. This, however, may not be the case under some conditions. Aggregation of primary particles into secondary and tertiary particles, root channels, and microcracks would account for a fraction of the pore volume with pore sizes not determined by the size distribution of primary particles. The abundance of such pores considerably determines the extent of deviation of prediction. Therefore, it is important to incorporate information of soil structure into soil hydraulic modeling if possible38. Soil structure is a non-negligible factor for accurate estimation of soil hydraulic properties using pedotransfer functions39,40. But this work is difficult to initiate because soil structure information (e.g., soil aggregate size distribution) is mostly unavailable compared to soil basic properties (e.g., particle size distribution, organic matter content, and bulk density). Insufficiency of identification of soil structure indices precludes the inclusion of soil structure characteristics into soil water retention modeling. Relevant efforts have been made in some large-scale models that consider soil structure. For instance, Fatichi et al.41 proposed to assess the impact of soil structure on global climate using an Ocean-Land–Atmosphere Model (OLAM). Although the model in this study does not explicitly include a structural component, in the FDB model we assume that soil bulk density could indirectly bring the influence of soil structure into the estimation of soil water retention.

Soil water retention characteristics were estimated using the FBD model from particle size distribution, bulk density, and measured residual water content. The starting point was the similarity of curve shapes between cumulative particle size distribution and soil water retention characteristics. Similarly, Arya and Paris19 and Haverkamp and Parlange21 used a simple equation to derive a set of soil water content according to the mass fraction of soil particles, and then a series of expressions were employed to regulate soil water pressure head to pair with measured soil water content. The FBD model adopted an opposite approach. A set of water pressure head from 15 cm to 15,000 cm were derived using a simple expression as Eq. (19), and then soil water contents were estimated with Eq. (8) to match the derived water pressure head. Eventually, an analytical model (Eq. 20) was obtained. In the FBD model, the water retention function included a residual water content in relation to the maximum water pressure head (15,000 cm) and the parameter (b) of soil pore size distribution. Similarly, the residual water content was considered in the van Genuchten model42 or Brooks and Corey model43. However, Campbell23 described soil water retention curve by assuming there was no residual water content. An advantage of the Campbell equation is its excellent fitting capability. Thus, we evaluated the performance of the FBD model by comparing it to the Campbell model in this study.

The selection of a Logistic-type equation for the model formulation was mainly due to the consideration that particle size distribution and pore size distribution in most soils were approximately lognormal27,44,45,46. The logistic growth equation generated a curve that tended towards an exponential form at low values and a power form at high values, with a power index smaller than 1. This characteristic implicitly included the consideration that the drainage of water in small pores at large suction was usually expected to be more impaired than the release of water from large pores at small suctions47,48.

Conclusions

An analytical model, which is based on a fractional bulk density concept, was presented for estimating soil water retention for the entire range of water pressure head that determines water availability. The proposed model was tested using 30 sets of soil water retention data measured for various textures of soils that had a wide range of soil bulk density from 0.33 g/cm3 to 1.65 g/cm3. Results showed that the proposed model could convert readily available soil physical properties into soil retention curves in very good agreement with the measurements, and the model was applicable to soils with limited data of soil particle size distribution at small loss of estimation accuracy in the middle portion of water retention curves of sandy soils. Sensitivity analyses revealed that saturated and residual water contents were two parameters of high sensitivity for accurate estimation of the water retention curves. The agreement between the estimated and measured results supported the concept underlying the FBD model. The modeling followed a process of conceptual partitioning of pore space according to the relative contribution of certain sizes of particles to the change in pore space. In addition, the model assumed a sigmoid curve of water retention characteristic for most soils. However, these assumptions need further verification by considering the physical reality of soils and potential improvements and extensions. Compared to subsurface soils, larger deviations should be expected for surface soil materials where aggregation, cracking, and root effects may be pronounced. Further tests of the model application to other soils (e.g., Vertisols, Aridisols, and salt affected soils) and evaluation of the effects of water hysteresis, soil aggregation, and swelling-shrinkage behaviors might reveal the weaknesses of the FBD model and help identify additional variables needed for model improvement.

Material and methods

Fractional bulk density concept

The first assumption is that soil particles with different sizes contribute to different porosities and water holding capacities in bulk soil. Based on a non-similar media concept (NSMC) defined by Miyazaki49, soil bulk density (ρb) is defined as

$$\rho _{{{b}}} = \frac{{{M}}}{{{V}}} = \tau \rho _{{{s}}} \left( {\frac{{{S}}}{{{{S}} + {{d}}}}} \right)^{3}$$
(4)

where M is the mass of a given soil, V is the volume of bulk soil, ρs is soil particle density, and S and d are characteristic lengths of solid phase and pore space, respectively. The parameter τ is a shape factor of the solid phase, defined as the ratio of the substantial volume of solid phase to the volume S3. The value of τ is 1.0 for a cube and π/6 for a sphere. As pointed out by Miyazaki49, these characteristic lengths are not directly measurable but are representative lengths in the sense of the characteristic length in a similar media concept (SMC). Following the approach of NSMC represented by Eq. (4), we conceptually defined the volume of bulk soil as

$$V = \frac{{\mathop \sum \nolimits_{{{i } = { 1}}}^{{{n}}} {{m}}_{{{i}}} }}{{{\rho }_{{{b}}} }} = \frac{{{{m}}_{{1}} }}{{{\rho }_{{{{b1}}}} }}{ + }\frac{{{{m}}_{{2}} }}{{{\rho }_{{{{b2}}}} }}{ + } \cdot{\mkern -4mu}\cdot{\mkern -4mu}\cdot \frac{{{{m}}_{{{n}}} }}{{{\rho }_{{{{bn}}}} }}$$
(5)

where mi and ρbi are the solid mass and equivalent bulk density of the ith size fraction of soil particles, respectively. In this study, diameters of the first particle fraction and the last one were assumed to be 1 µm and 1000 µm, respectively8. This equation suggests that different particle size fractions are associated with different equivalent bulk densities due to different contributions of particle arrangement to soil pore space. As a result, the particles with the same size fraction could have different equivalent bulk densities in soils with different textures or after the soil particles are rearranged (e.g., compaction). Figure 4 provides a diagrammatic representation of such fractional bulk density concept for the variation of soil pore volume with soil particle assemblage.

Figure 4
figure 4

Diagrammatic representation of the fractional bulk density (FBD) model. V and ρb are the volume of bulk soil and the bulk density of whole soil, respectively. mi, and ρbi refer to the solid mass and equivalent bulk density associated with the ith particle-size fractions, respectively.

Calculation of volumetric water content

For a specific soil, Eq. (5) means

$${{V}}_{{{{pi}}}} \left( { \le {{D}}_{{{i}}} } \right){{ } = { f}}\left( {{{D}}_{{{{gi}}}} {{, M}}_{{{i}}} } \right)$$
(6)

where Vpi(≤ Di) denotes the volume of the pores with diameter ≤ Di generated by soil particles with diametes ≤ Dgi in unit volume of soil. Mi is the cumulative mass percentage of the ≤ Dgi particles. Since the pore volume has the maximum value for a given bulk soil and the cumulative distribution of pore volume could be generally hypothesized as a sigmoid curve for most of the natural soils44,45, we formulated Eq. (6) using a lognormal Logistic equation,

$${{V}}_{{{{pi}}}} \left( { \le {{D}}_{{{{gi}}}} } \right) = \frac{{{{V}}_{{{{pmax}}}} }}{{1 + \kappa \left( {{{D}}_{{{{gi}}}} } \right)^{{{{b}}_{{{i}}} }} }}$$
(7)

where Vpmax is the maximum cumulative volume of pores pertinent to the particles smaller than or equal to the maximum diameter (Dgmax) in unit volume of soil. In fact, here Vpmax is equal to the total porosity (φT) of soil. Vpi (≤ Dgi) is the volume of the pores produced by ≤ Dgi particles in unit volume of soil, and bi is a varying parameter of increase in cumulative pore volume with an increment of Dgi. By assuming a complete saturation of soil pore space, Eq. (7) changes into

$$\theta_{{{i}}} \left( { \le {{D}}_{{{{gi}}}} } \right) = \frac{{\theta_{{{s}}} }}{{1 + \kappa \left( {{{D}}_{{{{gi}}}} } \right)^{{{{b}}_{{{i}}} }} }}$$
(8)

where θs is saturated volumetric water content calculated with

$$\theta_{{{s}}} = \left\{ {\begin{array}{*{20}l} {0.9\varphi _{{{T}}} ,} \hfill & {~\rho _{{{b}}} < 1} \hfill \\ {\varphi _{{{T}}} ,} \hfill & {~~\rho _{{{b}}} \ge 1} \hfill \\ \end{array} } \right.$$
(9)
$$\varphi _{{{T}}} = \frac{{\rho _{{{s}}} - \rho _{{{b}}} }}{{\rho _{{{s}}} }}$$
(10)

In the above equations, ρbis measured soil bulk density, and ρs is soil particle density (2.65 g/cm3). The empirical parameter κ in Eqs. (7) and (8) is defined as

$${{\kappa}} = \frac{{\theta_{{{s}}} - \theta_{{{r}}} }}{{\theta_{{{r}}} }}$$
(11)

where θr is measured residual water content. In this study, θr is set as the volumetric water content at water pressure head of 15,000 cm. The empirical parameter bi is defined as

$${{b}}_{{{i}}} = \frac{{\epsilon }}{{3}}{\log}\left( {\frac{{{\theta }_{{{s}}} {{ - \omega }}_{{{i}}} {\theta }_{{{s}}} }}{{{{\kappa \omega }}_{{{i}}} {\theta }_{{{s}}} }}} \right)$$
(12)

with ε, a particle size distribution index, calculated with

$${\varepsilon }\; = \;\frac{{\left( {{{D}}_{{{40}}} } \right)^{{2}} }}{{{{D}}_{{{10}}} {{D}}_{{{60}}} }}$$
(13)

where D10, D40, and D60 represent the particle diameters below which the cumulative mass percentages of soil particles are 10%, 40%, and 60%, respectively.

The parameter ωi is coefficient for soil particles of the ith size fraction, with a range of value between θr/θs and 1.0. By incorporating soil physical properties, ωi can be estimated with

$${\omega }_{{{i}}} = \frac{{{g}}}{{{{1 + \kappa }}\left( {{{lnD}}_{{{{gi}}}} } \right)^{{\lambda}} }}$$
(14)

where g is regulation coefficient (1.0–1.2). We set it to be 1.2 in this study. λ is the ratio coefficient of particle size distribution fitted using the lognormal Logistic model,

$$M_{i} = \frac{{M_{T} }}{{1 + \eta D_{{gi}} ^{\lambda } }}$$
(15)

where MT represent the total mass percentage of all sizes of soil particles, and η is a fitting parameter. We set MT = 101 in Eq. (15) for best fit of the particle size distribution. In this study, this continuous function was generated from the discrete data pairs of Dgi and Mi at cutting particle diameters of 1,000, 750, 500, 400, 350, 300, 250, 200, 150, 100, 50, 30, 15, 7.5, 5, 3, 2, and 1 μm. Considering the difference in the upper limits of particle sizes associated with existing datasets of Dgi and Mi, the particle size distribution with the upper limit of 2,000 μm for the Acolian sandy soil and volcanic ash soils in Table 2 was normalized to the case with the upper limit of 1,000 μm using Eq. (3).

Table 2 Physical properties of soils used in the study. ρb is bulk density (g/cm3); θr is residual water content (cm3/cm3) at 15,000 cm water pressure head; ε is particle size distribution index.

Calculation of water pressure head

To estimate the capillary tube or pore diameter (Di in µm), which was composed of particles with the size of Dgi (µm), Arya and Paris19 developed an expression

$${{D}}_{{{i}}} {{ } = { D}}_{{{{gi}}}} \left[ {\frac{{2}}{{3}}{{en}}_{{{i}}}^{{{{(1 - \alpha )}}}} } \right]^{{{0}{{.5}}}}$$
(16)

where α is the empirical scaling parameter varying between 1.35 and 1.40 in their original model19, but was thought to vary with soil particle size in the optimized model of Arya et al.20. In Tyler and Wheatcraft's model22 α is the fractal dimension of the pore. The parameter e is the void rate of entire soil and assumed unchanging with particle size. However, according to Eqs. (5) and (6), e in Eq. (16) should vary with particle size and be replaced by ei, which depends on soil particle sizes. ni is the number of particles in the ith size fraction with a particle diameter (Dgi in μm), assuming that the particles are spherical and that the entire pore volume formed by assemblage of the particles in this class is represented by a single cylindrical pore. The equation for calculating ni is given as19

$$n_{i} = \frac{{6m_{i} }}{{\rho_{s} \pi D_{gi}^{3} }} \times 10^{12}$$
(17)

where mi is the mass of particles in the ith size fraction of particles. Assuming that soil water has a zero contact angle and a surface tension of 0.075 N/m at 25 °C, the minimum diameter of soil pore (Dmin) was taken to be 0.2 µm in this study, which is equivalent to the water pressure head of 15,000 cm according to Young–Laplace equation. We set this minimum pore size to correspond the minimum particle size (Dgmin = 1.0 µm). The FBD model might thus not apply well to porous media with pores smaller than 0.2 μm. As a result, Eq. (16) can be simplified into the following equation.

$${{D}}_{{{i}}} { = 0}{{.2D}}_{{{{gi}}}}$$
(18)

The equivalent capillary pressure (ψi in cm) corresponding to the ith particle size fraction can be calculated using

$$\psi_{{{i}}} = \frac{{{3000}}}{{{{D}}_{{{i}}} }} = \frac{{{15000}}}{{{{D}}_{{{{gi}}}} }}$$
(19)

In Eq. (19), the maximum water pressure head (ψr = 15,000 cm) corresponds to θr and Dgmin (1 μm). The minimum water pressure head (ψ0 = 15 cm) corresponds to θs and Dgmax (1,000 μm). These assumptions were arbitrary and might not be appropriate for some soil types. But these values were used in the study because they approximated the practical range of measurements well.

The resulting model of soil water retention

Equations 8 and 19 formulate a FBD-based model for estimation of soil water retention curve. To simplify the computation, we incorporated the two equations into the following analytical form,

$${\theta }\; = \;\frac{{{\theta }_{{{s}}} }}{{{1 + }\left( {\frac{{{\theta }_{{{s}}} - {\theta }_{{{r}}} }}{{{\theta }_{{{r}}} }}} \right)\left( {\frac{{15,000}}{{\psi }}} \right)^{{{b}}} }}$$
(20)

with the parameter b obtained using

$${{b}}\; = \;\frac{{\epsilon }}{{3}}{\log}\left\{ {\frac{{{{(\theta }}_{{{s}}} - {\theta }_{{{r}}} {{)[ln(}}\frac{{{15,000}{{.1}}}}{{\psi }}{)]}^{{\lambda }} - {{(g}} - {{1)\theta }}_{{{r}}} }}{{{{g(\theta }}_{{{s}}} - {\theta }_{{{r}}} {)}}}} \right\}$$
(21)

In Eq. (21), a water pressure head of 15,000.1 cm is employed to consecutively predict the soil water content until the water pressure head of 15,000 cm.

Soil dataset

Evaluation of the applicability of the proposed modeling procedure required datasets that included soil bulk density, residual water content, and soil particle size distribution covering three particle diameters (D10, D40, and D60) below which the cumulative mass fractions of particles were 10%, 40%, and 60%, respectively. In addition, measured water content and water pressure head were required for the actual retention curve in order to compare with the result of the FBD model. In this study, the soil water retention data of 30 different soils, measured by Yu et al.50, Chen and Wang51, Zhang and Miao52, Liu and Amemiya53, Hayano et al.54, and Yabashi et al.55 were used for model verification (Table 2). The data covered soils in China (such as black soil, chernozem soil, cinnamon soil, brown earth, fluvo-aquic soil, albic soil, red earth, humid-thermo ferralitic, purplish soil, meadow soil, and yellow earth) and soils in Japan (such as volcanic ash soil and acolian sandy soil). The USDA soil taxonomy of these soils was provided in Table 2. The 30 soils ranged in texture from clay to sand and in bulk density from 0.33 g/cm3 to 1.65 g/cm3, which covered a much wider range of soil bulk density than many of the existing models or pedotransfer functions56,57,58,59. Particle size fractions (Dgi) were chosen as the upper limit of the diameters between successive sieve sizes. For the data set in which particle density was not determined, 2.65 g/cm3 was used.

Statistical parameters for model verification

Four statistical properties, R2, RMSE, mean residual error (ME), and t value were calculated to determine the accuracy of the FBD model. The R2 values were computed at the same value of ψ, with the values of θ measured and estimated by the FBD model (Eq. 20). RMSE and ME were obtained, respectively, by

$${{RMSE}}\; = \;\left[ {\frac{{1}}{{{n}}}\sum {{(\theta }}_{{{{est}}}} {{ - \theta }}_{{{{mea}}}} {)}^{{2}} } \right]^{{{0}{{.5}}}}$$
(22)
$${{ME}}\; = \;\frac{{1}}{{{n}}}\sum_{{{i } = { 1}}}^{{{n}}} \left( {{\theta }_{{{{est}}}} - {\theta }_{{{{mea}}}} } \right)$$
(23)

where θmea was measured soil water content, θest was soil water content estimated with the FBD model, and n was the number of measured pairs of water content and pressure head. With the assumption of normal distribution and independence of differences between the water contents measured and estimated by the FBD model, t was calculated with

$${{t}}\; = \;{{ME}}\left( {\frac{{{{RMSE}}^{{2}} - {{ME}}^{{2}} }}{{{{n}} - {1}}}} \right)^{{ - {0}{{.5}}}}$$
(24)

when calculated |t| was larger than t0.05 (the critical value of the Student’s t distribution for P = 0.05 and n−1 degrees of freedom), the differences between the measured and estimated water contents were statistically significant. If t < 0, soil water contents were underestimated and vice versa. Thus, t was a measure for the systematic bias in the estimation. Values of t close to zero indicated that the measured and estimated soil water contents were not different systematically from each other or, equivalently, that there was no consistent bias. Values of t that differed greatly from zero indicated the presence of systematic bias. RMSE was a measure for the scatter of the data points around the 1:1 line. Low RMSE values indicated less scatter. Low RMSE values also implied low ME. Regarding the result that t was low while RMSE was high, it could be explained that negative and positive deviations distributed more evenly on the two sides of 1:1 line.