Kriging-based surrogate data-enriching artificial neural network prediction of strength and permeability of permeable cement-stabilized base

Wang, Xiaoming; Xiao, Yuanjie; Li, Wenqi; Wang, Meng; Zhou, Yanbin; Chen, Yuliang; Li, Zhiyong

doi:10.1038/s41467-024-48766-4

Download PDF

Article
Open access
Published: 07 June 2024

Kriging-based surrogate data-enriching artificial neural network prediction of strength and permeability of permeable cement-stabilized base

Xiaoming Wang¹,
Yuanjie Xiao ORCID: orcid.org/0000-0003-4450-9012^1,2,
Wenqi Li¹,
Meng Wang¹,
Yanbin Zhou³,
Yuliang Chen⁴ &
…
Zhiyong Li⁴

Nature Communications volume 15, Article number: 4891 (2024) Cite this article

2636 Accesses
2 Citations
Metrics details

Subjects

Abstract

Limited test data hinder the accurate prediction of mechanical strength and permeability of permeable cement-stabilized base materials (PCBM). Here we show a kriging-based surrogate model assisted artificial neural network (KS-ANN) framework that integrates laboratory testing, mathematical modeling, and machine learning. A statistical distribution model was established from limited test data to enrich the dataset through the combination of markov chain monte carlo simulation and kriging-based surrogate modeling. Subsequently, an artificial neural network (ANN) model was trained using the enriched dataset. The results demonstrate that the well-trained KS-ANN model effectively captures the actual data distribution characteristics. The accurate prediction of the mechanical strength and permeability of PCBM under the constraint of limited data validates the effectiveness of the proposed framework. As compared to traditional ANN models, the KS-ANN model improves the prediction accuracy of PCBM’s mechanical strength by 21%. Based on the accurate prediction of PCBM’s mechanical strength and permeability by the KS-ANN model, an optimization function was developed to determine the optimal cement content and compaction force range of PCBM, enabling it to concurrently satisfy the requirements of mechanical strength and permeability. This study provides a cost-effective and rapid solution for evaluating the performance and optimizing the design of PCBM and similar materials.

Indirect prediction of graphene nanoplatelets-reinforced cementitious composites compressive strength by using machine learning approaches

Article Open access 20 June 2024

Data-driven prediction on critical mechanical properties of engineered cementitious composites based on machine learning

Article Open access 03 July 2024

Estimation of compressive strength of waste concrete utilizing fly ash/slag in concrete with interpretable approaches: optimization and graphical user interface (GUI)

Article Open access 26 February 2024

Introduction

The accelerated progress of artificial intelligence has facilitated the widespread implementation of data-driven machine learning (ML) techniques in material design optimization¹. Charrier & Ouellet-Plamondon² utilized artificial neural networks (ANNs) to evaluate the impact of admixtures on the fresh properties of cement slurries. To optimize the mechanical properties of cement paste, they determined the dosage of each admixture based on the critical yield stress. Other studies have also utilized ANNs^3,4,5. Moreover, ANN models have been utilized to predict the physical properties of concrete materials, including durability⁶, rheology⁷, optimal grading⁸, and flexural strength⁹. ANN models have exhibited strong robustness in previous applied studies and effectively solve highly nonlinear problems. However, ANN models require large amounts of training data¹⁰, making them cost-prohibitive in the civil engineering field, which is primarily based on laboratory experiments. The proxy model is a simplified function approximation of a complex model and can significantly reduce experimental costs, thereby promoting engineering analysis of complex systems¹¹. The kriging-based surrogate (KS) model has exhibited powerful analytical ability in predicting the failure probability^12,13,14,15, identifying damages in civil engineering structures¹⁶, and significantly reducing experimental costs^16,17,18. Therefore, a kriging-based surrogate model needs to be developed to aid ANN models in improving the prediction accuracy, thus reducing experimental costs. Further, the popularity of transfer learning has led to the need for a large number of raw datasets to train valuable weight files of transfer learning^19,20,21,22. However, obtaining such datasets is often challenging in the civil engineering field due to the time-consuming and labor-intensive nature of extensive complex laboratory tests involved. In contrast, a machine learning paradigm based on kriging-based surrogate modeling-aided artificial neural networks can be used to enrich limited datasets. This approach provides valuable references for addressing this challenge and effectively reduces the cost of constructing weighting models of migration learning.

Permeable cement-stabilized base materials (PCBMs) typically have highly interconnected porosity, which allows water to flow smoothly through them under the influence of gravity^23,24. Although these properties are advantageous in terms of fully permeable road base structures, they have a disadvantage, i.e., decreased mechanical strength. In fact, as the porosity increases, the contact area between cement bridges and the adjacent aggregates decreases, and the cement bridges between aggregates exhibit higher effective stress, which leads to failure at lower load levels, resulting in damage such as subgrade failure, clogged permeable pores, and uneven settlement of the subgrade^25,26,27. Therefore, to meet the long-term stable operation requirements of permeable bases, PCBMs with balanced mechanical strength and permeability coefficient are needed. The cement content of PCBMs and the compaction force during construction affect this balance due to changes in the spatial distribution and pore size^28,29. Due to the variations in the application environments and available materials, the strength and coefficient of permeability of PCBMs under different combinations of cement content and compaction force need to be evaluated based on past experiences and extensive trial-and-error tests. This often entails very high costs as well as repetitive and tedious work, which is detrimental to the promotion and improvement of PCBMs.

PCBM is similar to pervious concrete in terms of the fact that it also confronts challenges in balancing mechanical strength and permeability properties during the design phase. Existing studies have focused mainly on revealing the correlation mechanisms between the compressive strength of pervious concrete and its pervious pore structures and on developing predictive models that relate strength properties to pervious pore characteristics. Such work has proven valuable in guiding the optimal design of pervious concrete^{24,30,31,32,33,34,35,36}. However, laboratory testing methods are known to be time-consuming, labor-intensive, and financially costly. Furthermore, when changes occur in actual engineering conditions, extensive laboratory tests are required to re-evaluate the hydromechanical properties of materials, causing significant financial and labor investments. In light of this deficiency, a variety of alternative methods have been employed to evaluate the mechanical strength and infiltration properties of pervious concrete, including the random lattice discrete particle method³⁷, discrete element method (DEM)^38,39,40, finite element method (FEM)^41,42, computational fluid dynamics (CFD) method⁴³, and other self-developed numerical simulation methods^44,45,46. The aim of these methods is to reduce the expenses associated with laboratory tests and expedite the material design process. The balance between the mechanical strength and permeability properties of pervious concrete is a crucial area of study. By utilizing data-driven machine learning-based methods, it becomes possible to efficiently evaluate these properties and minimize testing costs while enhancing design efficiency. Currently, machine learning models are primarily trained by extensive laboratory test data to expedite the development of high-performance pervious concrete designs^47,48. However, the development of machine learning models requires substantial datasets, leading to much greater data demands and economic costs. To overcome these challenges, there is a need to create a cost-effective machine-learning model that can streamline experimentation and expedite the design of materials.

As compared to pervious concrete, PCBMs involve more intricate preparation processes and internal structures. Pervious concrete generally contains 20%–40% cement^49,50 and is self-compacted or compacted by lower compaction forces^46,47,51. Due to its relatively higher cement content, pervious concrete usually exhibits greater mechanical strength. Conversely, PCBM typically contains 3% to 15% cement⁵² and is compacted by forces up to 300 kN. A reduced cement content leads to decreased mechanical strength, while increased compaction force may result in inadequate permeability of the PCBM. Xiao et al.⁵³ established a preliminary linkage between field and laboratory material compaction by directly applying field compaction force levels in laboratory specimen compaction⁵³. However, there is still a lack of in-depth research in this area, especially for permeable cement-stabilized base materials. Compared to pervious concrete, PCBMs exhibit a much more intricate balance between mechanical strength and permeability properties. The prediction of PCBM properties, such as mechanical strength, porosity, and coefficient of permeability, from laboratory design parameters (e.g., cement content and compaction force) is highly indispensable for optimizing PCBM design. However, few studies have focused on this specific aspect.

In this study, a kriging-based surrogate data-enriching artificial neural network (KS-ANN) model is developed to achieve high-precision prediction results with a small amount of data and provide heuristic theoretical references for the design optimization and compaction scheme determination of PCBMs. To achieve this main objective, a series of subsidiary studies are conducted. This includes (i) establishing a reliable laboratory test plan to obtain authentic test samples and provide reliable data input for the kriging-based surrogate (KS) models, (ii) using KS models to aid ANN models in effectively predicting the uniaxial compressive strength and coefficient of permeability of PCBMs, and (iii) applying the proposed method to optimize the PCBM design and compaction schemes. The laboratory test of PCBMs is chosen as a prime example. The suggested methodology is integrated into the decision-making process to enhance the design and on-site compaction plan of PCBMs. The suggested approach has the potential to significantly decrease the quantity of laboratory testing specimens as well as testing expenses. Additionally, a connection is established among laboratory testing, mathematical models, and ML, providing useful theoretical insights for projecting the functionality, refining the design, and engineering applications of analogous materials. It’s worth noting that the novelty of the study lies in the application of data enrichment techniques combined with machine learning models for the specific PCBMs, rather than purely developing a new technique that can be applied to various types of data problems.

Results

Experimental design and results

For PCBMs, the materials are required to have a 28-day unconfined compressive strength of at least 3.5 MPa and a coeffcient of permeability of at least 0.5 ${{{{{\rm{mm}}}}}}\cdot {{{{{{\rm{s}}}}}}}^{-1}$⁵⁴. Laboratory uniaxial compression tests are typically performed to directly examine whether the unconfined compressive strength (${{{{{\rm{\sigma }}}}}}$) of a material meets the usage requirements. However, in the on-site compaction construction procedure, the PCBM is first thoroughly mixed and evenly spread in the subgrade. Then, static pre-compaction is performed using a 10-ton (approximately 100 kN) steel wheel roller. Subsequently, a second static compaction of the PCBM is performed using a 20-ton (approximately 200 kN) steel wheel roller, resulting in a cement-stabilized base in a well-compacted state. Finally, to achieve the required density, a third static compaction is performed on the cement-stabilized base using a 30-ton (approximately 300 kN) rubber wheel roller. During the three rounds of compaction operations, to ensure that the cement paste adhering to the aggregate surfaces does not fall off, the roller compactor always maintains static compaction mode, which eliminates the vibration compaction process. In laboratory experiments, to reproduce the compacted state of PCBMs, the static pressure method is usually used for sample preparation. To investigate the influences of compaction force and cement content on the strength and coefficient of permeability of PCBMs, this study designed three different levels of static compaction force (i.e., 100 kN, 150 kN, and 200 kN) and three different levels of cement content (i.e., 5%, 10%, and 20%). Therefore, nine different working conditions corresponding to different combinations of cement content and compaction force were specified. The combinations of these parameters yielded the tested unconfined compressive strength (${{{{{\rm{\sigma }}}}}}$), porosity (P), and coefficient of permeability (K), which formed a relatively small dataset. The obtained small datasets were used as inputs for the KS-ANN model, which was trained to predict the strength and coefficient of permeability of PCBMs under the compaction force of 300 kN. Among them, the orthogonal experimental design is shown in Tables 1 where 3 replicate specimens were tested for each group of experiments. The detailed laboratory testing process and analysis of the PCBM test results are shown in the supplementary information document.

Table 1 Orthogonal design parameters for the laboratory unconfined compression tests

Full size table

In actual engineering applications, to achieve desired permeability, aggregates larger than 4.75 mm are typically used. Figure 1a displays the typical gradation commonly used in actual engineering applications.

**Fig. 1: Design gradation curve of the materials and statistical correlations of the laboratory test results.**

The inner diameter and height of the mold used for the unconfined compressive strength test in the laboratory are 140 and 180 mm, respectively. The designed aggregate quantity is 4.0 kg, and the main rock of the aggregates is limestone with a natural density of approximately 2600 kg/m³. In order to maximize the adhesion of the cement paste to the aggregate surfaces, the aggregates were washed before sample preparation. This is done to prevent impurities, such as dust and soil, from affecting the adhesion rate of the cement paste to the aggregate surfaces. The washed aggregates were then kept in a sealed plastic bag for 24 h before sample preparation. In addition, the adhesion of the cement paste to the aggregate surfaces can be improved by reducing the fluidity of the paste. Thus, the range of aggregate sizes used in this paper is fixed from 4.75–26.5 mm, and the water-cement ratio is fixed at 0.4. The detailed design parameters of the cement paste are shown in Table 2.

Table 2 Design parameters of the cement paste in PCBM

Full size table

After thoroughly mixing and blending the aggregates, cement, and water, they are placed into a mold and compacted using the designed compaction force. After compaction, the sample was de-molded after being left to stand for 24 h under laboratory conditions and then placed in a standard curing room for 28 days. In this study, the porosity (P), coefficient of permeability (K), and unconfined compressive strength (${{{{{\rm{\sigma }}}}}}$) of PCBM specimens were determined to obtain accurate laboratory data. Among them, each combination of conditions included 3 replicate specimens. The unconfined compressive strength (${{{{{\rm{\sigma }}}}}}$), porosity (P), and coefficient of permeability (K) under different combination conditions include a total of 81 data points. The laboratory test results are shown in Table 3.

Table 3 Laboratory test results for the uniaxial compressive strength, porosity, and coefficient of permeability of PCBMs

Full size table

In practical engineering design, PCBMs need to fulfill the requirements of both unconfined compressive strength and permeability. It is not economical to ensure the balance range of the mechanical strength and permeability of PCBMs through a large number of laboratory experiments, and the application of machine learning methods in databases with few samples mainly suffers from low accuracy and underfitting problems. Therefore, we developed a KS-ANN method to seek to improve the prediction accuracy with a small number of samples and to optimize material design and sample preparation methods (or on-site compaction).

Correlation analysis of the unconfined compressive strength, porosity, permeability coefficient, compaction force, and cement content of PCBMs

Figure 1b presents the results of the correlation analysis between different design variables and performance indicators. As shown in the figure, the coefficient of permeability (K) and porosity (P) exhibit negative correlations with the compaction force or cement content. The cement content significantly affects the coefficient of permeability (K), with a correlation coefficient of −0.89. The correlation coefficient between the compaction force and coefficient of permeability (K) is only −0.42. The correlation coefficient between the porosity (P) and cement content is −0.76, while that between the porosity (P) and compaction force is only −0.29. The unconfined compressive strength (σ) of the sample is positively correlated with both the compaction force and cement content, and the unconfined compressive strength (σ) has a significantly greater correlation coefficient with the cement content (0.91) than with the compaction force (0.35). Additionally, the correlation coefficient between the compaction force and cement content is 0, indicating no correlation. However, the values of the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K) differ under different combinations of compaction force and cement content. Therefore, significant statistical inter-correlations exist among the unconfined compressive strength (σ), porosity (P), coefficient of permeability (K), and sample preparation conditions, which is a prerequisite for using a kriging-based surrogate model.

The above analysis indicates that an excessively high compaction force (corresponding to the on-site compaction force) has less effect on the unconfined compressive strength of PCBMs, whereas a slight increase in cement content can significantly enhance the unconfined compressive strength of PCBMs. However, an excessive compaction force may cause blockage of internal permeable voids. The adverse influence of the cement content on the coefficient of permeability (K) and porosity (P) may render the pervious functionality of PCBM inadequate and, hence, should not be ignored. Therefore, obtaining the optimal ranges of cement content and compaction force (corresponding to the on-site compaction force) through the proposed method is of great theoretical and practical significance for balancing the mechanical strength and coefficient of permeability of PCBMs.

Implementation process and result analysis of the KS-ANN model

To achieve high predictive accuracy for the KS-ANN model, we first constructed statistical distribution models of the experimental data and used the Markov chain-Monte Carlo (MCMC) method to expand the experimental data. Based on this, a kriging-based surrogate (KS) model was executed to obtain results for the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K) under more combinations of working conditions. This mathematically expanded the experimental database while ensuring data continuity, and the mean and variance values of the expanded database remained consistent with those of the previous database. Finally, the results of the KS model were used to train an ANN model, and the KS-ANN model was compared with the traditional ANN model.

Analysis of Markov Chain Monte Carlo (MCMC) simulation results

The unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K) obtained from laboratory tests under different testing conditions typically exhibit certain degrees of variability. For an ANN model, using test data with variable inputs helps it learn the discrete distribution characteristics of true data, thereby yielding predictive results that are closely aligned with actual conditions. Prior to MCMC simulations, the distribution model, mean, and variance of the data under different operating conditions need to be obtained. However, the data samples under each individual condition are limited, making it difficult to fit reliable statistical distribution models. Therefore, this study assumed that the data distribution under individual conditions follows the data distribution under group conditions. By analyzing the statistical distribution models of all the data, the data of the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K) were determined to follow a normal distribution. Therefore, a normal distribution model was used to fit the data of individual conditions to obtain their mean and variance values. Figure 6d displays the spatial distribution characteristics of the mean and variance of the unconfined compressive strength of the samples under different conditions (see below). The results serve as input for MCMC simulations for generating a massive amount of virtual data with the same distribution characteristics, which were used to train the KS model. The same processing method was employed for the porosity (P) and coefficient of permeability (K). Furthermore, as suggested by the ensemble learning concept^1,55, multiple random numbers can be used in MCMC to generate multiple initial datasets with the same mathematical distribution but different data. This allows for the training of multiple parallel KS-ANN models. The selection of random numbers can be customized based on the specific data requirements. Additionally, the predictions from these models can be averaged or voted to obtain results that meet the desired criteria. Therefore, in this paper, multiple random numbers were specified in the MCMC simulations to train multiple parallel KS-ANN models for predicting the unconfined compressive strength (${{{{{\rm{\sigma }}}}}}$), porosity (P), and coefficient of permeability(K), respectively.

Figure 2 presents the mean and variance values of the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K). The figure shows that the MCMC simulation results incorporate the statistical distribution characteristics of different physical indicators. By using the MCMC simulation results to train a kriging-based surrogate model, a kriging-based surrogate model with desired statistical distribution characteristics is obtained accordingly. The results from the kriging-based surrogate model are ultimately used to train an ANN model. Therefore, the predicted results of the trained ANN model exhibit the discrete distribution characteristics of different physical indicators, which are consistent with the statistical characteristics reflected in the laboratory results.

**Fig. 2: Mean and variance results of the samples used for the Markov chain Monte Carlo (MCMC) simulations.**

In MCMC simulations, for each individual condition, 60,000 discrete data points are simulated, and the first 10,000 data points are discarded due to poor convergence. In this study, nine different combinations of conditions were considered, resulting in the simulation of 450,000 data points. Each condition included three physical indicators, resulting in the simulation of a total of 1,350,000 data points. Figure 3a–i present the MCMC simulation results. To make the labels in the figures clearly visible, we randomly selected a small number of MCMC generated data points for plotting. It is worth noting that the randomly selected data points represent the spatial distribution characteristics of all the data. As shown in Fig. 3a, the 5-100 (denoting cement content of 5% and compaction force of 100 kN) sample exhibits relatively small variability in terms of the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K). For samples 5-150 and 5-200 (see Fig. 3b, c), the variability in the spatial distribution of porosity (P) is significantly greater than that of the coefficient of permeability (K) or unconfined compressive strength (σ), with a data distribution that appears elongated. Furthermore, the samples with cement content of 10% (i.e., 10-100, 10-150, and 10-200, as shown in Fig. 3d–f) exhibit the same data distribution characteristics. However, for the samples with cement content of 20% (i.e., 20-100, 20-150, and 20-200, as shown in Fig. 3g–i), the variability in the spatial distribution of the unconfined compressive strength (σ) under different compaction forces is significantly greater than that of the porosity (P) or coefficient of permeability (K), with more significant differences observed for samples 20-150 and 20-200.

**Fig. 3: Markov Chain Monte Carlo (MCMC) simulation and Kriging-based surrogate (KS) model prediction results.**

Prediction results of the KS model

By using the proposed KS model, training was performed with the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K) data by inputting parameter ${{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}$, thereby obtaining the results shown in Fig. 3j–l. The figure shows that the spatial distribution pattern of the unconfined compressive strength (σ) (Fig. 3j) is opposite to that of the porosity (P) (Fig. 3k) or coefficient of permeability (K) (Fig. 3l), with porosity exhibiting the highest degree of variability. Specifically, as the unconfined compressive strength (σ) increases, the porosity (P) and permeability coefficient (K) decrease. Finally, the KS model data were standardized and input into an ANN model for training, which resulted in reliable weight files that can be used to predict the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K) of PCBMs under different conditions (or in-situ compacted conditions).

Prediction results of the KS-ANN models

For a KS-ANN model, the greatest challenge is to determine the values of the parameters in the hidden layers. Herein, the trial and error strategy was used to continuously adjust the parameters in the hidden layers to minimize the prediction error of the KS-ANN model. After fine-tuning these parameters, the number of hidden layers in the ANN model was determined to be 3, with each hidden layer containing 128 neurons.

To evaluate the robustness of the KS-ANN model, a test dataset was created using the mean values of various physical indicators output by the KS model (see Fig. 4b). Additionally, another testing dataset was generated using discrete data of different physical indicators output by the KS model (see Fig. 4a). These testing datasets were used to verify the performance of the KS-ANN model. The results in Fig. 4b demonstrate that the KS-ANN model accurately predicts the mean of the data output based on the KS model, with a coefficient of determination (R²) of 0.99. These findings confirm the reliability of the unbiased estimation of the KS model in surrogate modeling. Moreover, the results in Fig. 4a indicate that the KS-ANN model leads to an R² value of 0.94 when predicting the discrete distribution of data, further supporting the robustness of the KS-ANN model.

**Fig. 4: Prediction results of the ANN/KS-ANN model.**

Figure 4c–e present the individual prediction results of the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K), respectively. Figure 4d shows that the porosity (P) has the lowest coefficient of determination (0.88), while the coefficient of permeability (K) has a coefficient of determination of 0.99 (Fig. 4e), exhibiting the best prediction result. The coefficient of determination of the unconfined compressive strength (σ) is somewhere in between (0.93) (Fig. 4c). In PCBMs, the unconfined compressive strength (σ) and coefficient of permeability (K) are direct indicators for evaluating whether or not the material meets the design requirements, and they can be accurately predicted by the well-trained KS-ANN model to optimize the design and compaction scheme.

To further verify the robustness of the KS-ANN model, we randomly selected 30% of the data obtained from laboratory tests as a supplementary test dataset to verify the prediction accuracy of the KS-ANN model. Note that this portion of data was intentionally excluded for use in the training datasets of any ANN model, i.e., it was solely used as an additional verification dataset. Figure 4g, h show the prediction results obtained by both the traditional ANN model and the proposed KS-ANN model for such a portion of laboratory test data. It can be seen from Fig. 4g that the coefficient of determination for the traditional ANN model is only 0.76; on the other hand, the proposed KS-ANN model leads to a coefficient of determination of 0.92 (see Fig. 4h), thus indicating a prediction accuracy improvement of 21% as compared to that of the traditional ANN model. This confirms that the use of the KS algorithm in the ANN model significantly improves the prediction accuracy. The proposed method makes it possible to apply machine learning in small-size databases, which has the potential to reduce experimental costs and enhance material design optimization.

Data-driven design optimization concept of the PCBM

A balance between the unconfined compressive strength and coefficient of permeability is a prerequisite for the widespread applications of PCBMs. Although the design method stemming from cement-stabilized dense-graded base materials may improve the strength of PCBMs, their coefficient of permeability may not meet the requirements. Therefore, when either laboratory sample preparation or on-site compaction is employed, PCBMs should simultaneously meet the requirements of the unconfined compressive strength and coefficient of permeability. Additionally, design optimization through methods such as orthogonal design in laboratory sample preparation or on-site compaction is time-consuming and costly. To address these issues, the proposed method predicts the unconfined compressive strength and coefficient of permeability under different working conditions, including sample preparation conditions that cannot be achieved in laboratory tests. Based on this, a design optimization method is proposed for PCBMs with the unconfined compressive strength and coefficient of permeability targeted. The specific implementation procedure is described as follows:

(1)
Based on the correspondence between laboratory sample preparation and field compaction conditions, the random variables ${{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}=[{C}_{i},\, {F}_{i}]$ are determined, which are directly related to the unconfined compressive strength and coefficient of permeability. A small number of orthogonal experiments are designed to obtain the physical indicators ${{{{{{\boldsymbol{Y}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}$ [i.e., the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K)] under different random variables ${{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}$, which serve as input parameters for the KS-ANN model.
(2)
To obtain the optimal sample preparation or compaction scheme, a performance function is established for the optimization scheme. Typically, the unconfined compressive strength (σ) and coefficient of permeability (K) are selected as performance evaluation indicators of laboratory sample preparation or field compaction results. The expression of the performance function is as follows:

$$\left\{\begin{array}{c}\sigma \left({{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}\right)\ge {\sigma }_{\min }\\ K\left({{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}\right)\ge {K}_{\min }\end{array}\right.$$

(1)

where ${\sigma }_{\min }$ and ${K}_{\min }$ are the minimum values specified by related specifications or regulations⁵⁴, which are preselected as 3.5 MPa and 0.5 ${{{{{\rm{mm}}}}}}\cdot {{{{{{\rm{s}}}}}}}^{-1}$ in this study, respectively.

Design optimization function for PCBMs

Figure 5a–b illustrate the prediction results of the KS-ANN model for the unconfined compressive strength (σ) (Fig. 5a) and coefficient of permeability (K) (Fig. 5b) over a wide range, respectively. As shown in the figure, the distribution characteristics of the unconfined compressive strength (σ) and coefficient of permeability (K) are consistent with the patterns observed in laboratory tests. To obtain an optimized function, data points meeting the performance function over a wide range need to be obtained. Therefore, the optimization function aims to find the contour that meets the performance function in the matrix plot of Fig. 5a–b, which is the desired design optimization function.

**Fig. 5: Design optimization process of PCBM corresponding to actual engineering applications.**

In order to verify the predictive performance of the KS-ANN model against external datasets beyond the training datasets used, 9 replicate specimens were prepared in the laboratory with a cement content of 10% and a compaction force of 300 kN for conducting unconfined compression tests. Figure 4f shows the comparison between the unconfined compressive strength (σ) results predicted by the KS-ANN model and the experimental measurements. Consistent with the experimental results, the KS-ANN model utilized nine different random numbers to predict the unconfined compressive strength (σ). It is evident from Fig. 4f that the predicted results of the KS-ANN model exhibit dispersity characteristics that are similar in magnitude and trend to the laboratory test results. The average value of the results predicted by the KS-ANN model was approximately 9.54 MPa, while the average value of the laboratory test results was approximately 9.56 MPa. The difference was only 0.02 MPa. Therefore, this finding verifies the robustness of the KS-ANN model.

Figure 5c, d display both the construction and solving processes of the design optimization function. First, a self-developed Python® program is used to solve the points satisfying Eq. (1) on the matrix graph of the predicted results of the unconfined compressive strength (σ) and coefficient of permeability (K). Then, the equivalent points are projected onto the same matrix graph (Fig. 5c). The obtained equivalent points are retained on the matrix graph, which is converted into the regular Cartesian coordinate system. By fitting the equivalent points, the desired design optimization function can be obtained (Fig. 5d). According to Fig. 5d, the ideal optimal region is bounded by the optimization function and the coordinate axes. For instance, when the compaction force is 300 kN, the maximum acceptable cement content is 13.5%. However, in actual engineering practice, the cement content of PCBMs is typically not less than 3%. Besides, actual PCBMs need to achieve a suitable degree of compaction, that is, a compaction force of 300 kN is often needed. Therefore, the optimal cement content range that can simultaneously meet the requirements of the unconfined compressive strength (σ) and coefficient of permeability (K) is between 3% and 13.5%.

Through the above process, an optimization function denoted by Eq. (2) was formulated. Given the specified cement content, it becomes feasible to anticipate the range of compaction force that fulfills both the unconfined compressive strength and coefficient of permeability simultaneously. It is important to bear in mind that during the practical compaction process of PCBMs, the minimum compacting force, ${F}_{\min }$, generally exceeds 100 kN, while the maximum value, ${F}_{\max }$, typically does not exceed 300 kN.

$$\left\{\begin{array}{c} {F}_{\min }=228.61+1.57C-1.27{C}^{2}\,{R}^{2}=0.99 \hfill \\ {F}_{\max }=10045.3-1706.09C+98.95{C}^{2}-1.93{C}^{3}\,{R}^{2}=0.96\end{array}\right.$$

(2)

where ${F}_{\min }$ and ${F}_{\max }$ are the minimum and maximum compaction forces (in unit of kN), respectively; and $C$ is the designed cement content (in unit of %).

After obtaining the optimization functions of PCBM, with the assumption that the cement content of PCBM is designed to be 10% in related engineering practices, the range of compaction force demanded by Eq. (1) can be calculated as [117.31, 300.00] according to Eq. (2). This indicates that the minimum compaction force should not be lower than 117.31 kN, while the upper limit of the compaction force is determined to be 300 kN by the on-site construction process. Moreover, laboratory specimens with a cement content of 10% were prepared using a compaction force of 300 kN to validate the effectiveness of the optimization functions. Since the unconfined compressive strength results in Fig. 4f confirm the validity and credibility of the developed KS-ANN model, the KS-ANN model is then utilized to predict the unconfined compressive strength and coefficient of permeability of laboratory specimens with a cement content of 10% and a compaction force of 300 kN. As shown in Fig. 5e, f, the mean value of the unconfined compressive strength predicted by the KS-ANN model is approximately 9.54 MPa, whereas the mean value of the coefficient of permeability is approximately 1.11 ${{{{{\rm{mm}}}}}}\cdot {{{{{{\rm{s}}}}}}}^{-1}$. These values satisfy the functionality and performance requirements. The above results confirm the effectiveness of the developed optimization functions and corresponding strategy, which could guide the design optimization and compaction scheme determination for PCBMs.

In this study, a new KS-ANN model is proposed and its robustness is verified. The well-trained KS-ANN model was then used to predict the unconfined compressive strength (σ) and coefficient of permeability (K) of PCBMs over a wide range. This study provides technical references for material design optimization and laboratory or in-situ compaction scheme determination. The main findings are as follows.

(1)
The proposed method effectively integrates real laboratory tests, mathematical modeling, and ML methods, making full use of all available information (which is often limited in reality). Compared to traditional ANN models, the proposed KS-ANN model improved the prediction accuracy by 21% for samples with small datasets.
(2)
The trained KS-ANN model exhibited high accuracy as indicated by the coefficient of determination (R²) of 0.99 while predicting the mean of the KS model, as well as by the average accuracy (R² = 0.94) while predicting the discrete distribution features of the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K). The trained KS-ANN model can be used to accurately predict the macroscopic hydromechanical properties of PCBMs, allowing for cost-effective optimization of its design and compaction strategies.
(3)
Based on the prediction results of the KS-ANN model, an optimized function that can simultaneously satisfy the requirements of the unconfined compressive strength (σ) and coefficient of permeability (K) was obtained, which can accurately calculate the optimal cement content and compaction force ranges of PCBMs. The proposed method can significantly reduce the experiment cost and provide a heuristic theoretical reference for predicting key hydromechanical properties, design optimization, and engineering practices for similar materials.
(4)
The proposed method uses the data-enriching technique assisted by a kriging-based surrogate model, which is highly beneficial for enhancing the predictive accuracy and efficiency of the ANN model. However, to extend the use of this proposed method in other engineering applications, the input data must satisfy the requirements of the kriging-based surrogate models, i.e., they cannot exceed 4 dimensions. This is the major limitation of this proposed method, which is being tackled in the research efforts currently underway.

Methods

Framework of the KS-ANN model

Kriging-based surrogate (KS) models advantageously afford highly accurate predictions within a domain, even with a limited number of samples. However, according to the previous studies by Zeng et al.¹⁴ and Sun et al.¹⁵, the prediction results of the KS model may not converge outside the domain of the real data or outside the boundary. Therefore, real data from the entire domain need to be obtained when practically applying KS models, which can be challenging under strict laboratory testing conditions. In contrast, ANN models not only exhibit satisfactory prediction accuracy within specific domains but also yield trustworthy prediction results when applied to domains reasonably beyond their initial training set. However, disadvantageously, ANN models require substantial amounts of sample data to form the training set, which inevitably results in high costs and requires significant effort and time. By combining the advantages of KS models and ANN models, accurate data prediction across the entire domain can be realized with a small number of laboratory test samples, thereby compensating for the deficiencies of laboratory tests and reducing the cost.

The proposed KS-ANN framework for obtaining reliable prediction accuracy with a limited number of data samples is shown in Fig. 6. First, laboratory tests are conducted to observe the hydromechanical performance parameters of the PCBM under different combinations of testing conditions (Fig. 6a–c). The proposed combination of testing conditions considered herein includes the cement content of the sample and the static compaction force during the sample preparation, corresponding to the load applied by the roller compactor in the field construction sites. The obtained hydromechanical performance parameters are the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K). Then, the statistical distribution models of each hydromechanical performance parameter under the specific operating conditions are constructed to replace their actual observation results (Fig. 6d), and the Markov chain Monte Carlo (MCMC) method is used to simulate abundant data samples, which completely agree with the statistical distribution models and their metrics of each hydromechanical performance parameter (Fig. 6e). Subsequently, the simulation results are used to train a reliable KS model. In the KS model, numerous virtual combinations of working conditions are interpolated, and the trained KS model is used to predict hydromechanical performance parameters under different combinational conditions (Fig. 6f). Subsequently, a large number of high-precision virtual samples distributed within the domain are obtained. Finally, the obtained output data of the KS model are used as a training dataset for the ANN model to train a reliable weight model for predicting the hydromechanical performance indicators of the PCBM, including the uniaxial compressive strength (σ), porosity (P), and coefficient of permeability (K). Hence, it is necessary to construct a multi-output artificial neural network (ANN) model to meet the prediction needs of PCBMs (Fig. 6g). Additional details can be found in related work⁴⁸.

Fig. 6: Proposed framework for using a kriging-based surrogate data-enriching artificial neural network (KS-ANN) for improving the prediction accuracy of the strength and coefficient of permeability of PCBM.

Kriging-based surrogate (KS) model

A substantial number of data samples is a fundamental prerequisite for successful ANN model training. However, obtaining a significant amount of data may involve tens of thousands of repetitive laboratory tests, which is challenging in practical engineering applications. Therefore, this study proposes the use of KS models to train the relationship between random input variables ${{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}=[{C}_{i},{F}_{i}]$ and the hydromechanical performance indicators (${{{{{\boldsymbol{Y}}}}}}$) [i.e., the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K)] under different operating conditions. A trained KS model can be used to effectively predict the hydromechanical performance indicators under different laboratory test conditions within the domain.

KS models are semiparametric surrogate models based on statistical theory^18,56,57,58. They accurately predict data samples within a domain, thereby providing additional predicted samples within the domain that have the same physical significance as real samples. Typically, the response function of KS models comprises parametric linear regression and nonparametric stochastic processes^14,15,59,60:

$$\hat{\Gamma }\left({{{{{\boldsymbol{x}}}}}}\right)={{{{{{\boldsymbol{f}}}}}}{{{{{\boldsymbol{(}}}}}}{{{{{\boldsymbol{x}}}}}}{{{{{\boldsymbol{)}}}}}}}^{T}{{{{{\boldsymbol{\beta }}}}}}+{{{{{\rm{g}}}}}}({{{{{\boldsymbol{x}}}}}})$$

(3)

where $\hat{\Gamma }\left({{{{{\boldsymbol{x}}}}}}\right)$ is the predicted value of a kriging-based model for the response function $\Gamma \left({{{{{\boldsymbol{x}}}}}}\right)$. ${{{{{\boldsymbol{f}}}}}}\left({{{{{\boldsymbol{x}}}}}}\right){{{{{\boldsymbol{=}}}}}}{[{f}_{1}\left({{{{{\boldsymbol{x}}}}}}\right),\, {f}_{2}\left({{{{{\boldsymbol{x}}}}}}\right),\, {f}_{3}\left({{{{{\boldsymbol{x}}}}}}\right),\ldots {f}_{k}\left({{{{{\boldsymbol{x}}}}}}\right)]}^{T}$ is a vector of basic functions. ${{{{{\boldsymbol{\beta }}}}}}=[{\beta }_{1},\,{\beta }_{2},\,{\beta }_{3},\ldots {\beta }_{n}]$ is a vector of trend coefficients. ${{{{{\rm{g}}}}}}({{{{{\boldsymbol{x}}}}}})$ is a random Gaussian process with a mean of 0 and a variance of ${\delta }^{2}$, where ${\delta }^{2}$ is the variance of the input data. The unbiased estimator ($E(\hat{\Gamma }\left({{{{{\boldsymbol{x}}}}}}\right)-\Gamma \left({{{{{\boldsymbol{x}}}}}}\right))=0$) guarantees the stability of the random Gaussian process. Here, the covariance of any two points ${{{{{{\boldsymbol{x}}}}}}}_{i}$ and ${{{{{{\boldsymbol{x}}}}}}}_{j}$ can be calculated as follows:

$${Cov}[{{{{{\rm{g}}}}}}\left({{{{{{\boldsymbol{x}}}}}}}_{i}\right),\,{{{{{\rm{g}}}}}}({{{{{{\boldsymbol{x}}}}}}}_{j})]={\delta }^{2}R({{{{{\boldsymbol{\theta }}}}}}{{{{{\boldsymbol{,}}}}}}\,{{{{{{\boldsymbol{x}}}}}}}_{i},\,{{{{{{\boldsymbol{x}}}}}}}_{j})$$

(4)

where $R({{{{{\boldsymbol{\theta }}}}}}{{{{{\boldsymbol{,}}}}}}\,{{{{{{\boldsymbol{x}}}}}}}_{i},\,{{{{{{\boldsymbol{x}}}}}}}_{j})$ represents the correlation function and θ is a predefined model parameter. Typically, the correlation function includes linear, spherical, and Gaussian models. The Gaussian model employed in this study is defined as:

$$R\left({{{{{\boldsymbol{\theta }}}}}}{{{{{\boldsymbol{,}}}}}}\,{{{{{{\boldsymbol{x}}}}}}}_{i},\,{{{{{{\boldsymbol{x}}}}}}}_{j}\right)={\prod}_{p=1}^{m}\exp \left(-{\theta }_{p}{d}_{p}^{2}\right),\,p=1,2,3,\ldots,\, m$$

(5)

where ${\theta }_{p}$ represents the p-th component of ${{{{{\boldsymbol{\theta }}}}}}$. ${d}_{p}={x}_{{ip}}-{x}_{{jp}}(p={{{{\mathrm{1,2,3}}}}},\ldots,m)$ is the distance between the p-th coordinates of the sample points ${{{{{{\boldsymbol{x}}}}}}}_{i}$ and ${{{{{{\boldsymbol{x}}}}}}}_{j}$. $m$ represents the dimensionality of the coordinates.

Consider a dataset for training denoted as ${{{{{{\boldsymbol{x}}}}}}=[{{{{{{\boldsymbol{x}}}}}}}_{1},\, {{{{{{\boldsymbol{x}}}}}}}_{2},\, {{{{{{\boldsymbol{x}}}}}}}_{3},\ldots,\, {{{{{{\boldsymbol{x}}}}}}}_{n}]}^{T},\, ({{{{{{\boldsymbol{x}}}}}}}_{i}=\left[{x}_{i1},\,{x}_{i1},\,{x}_{i1},\,\ldots,\,{x}_{i1}\right]{i}={{{{\mathrm{1,2,3}}}}},\ldots,\, n)$ and a performance response function denoted as ${{{{{\boldsymbol{\Gamma }}}}}}={[\Gamma \left({{{{{{\boldsymbol{x}}}}}}}_{1}\right),\, \Gamma \left({{{{{{\boldsymbol{x}}}}}}}_{2}\right),\, \Gamma \left({{{{{{\boldsymbol{x}}}}}}}_{3}\right),\ldots,\, \Gamma \left({{{{{{\boldsymbol{x}}}}}}}_{N}\right)]}^{T}$. Then, the maximum likelihood estimation for ${{{{{\boldsymbol{\beta }}}}}}$ and ${\delta }^{2}$ can be obtained as follows:

$$\left\{\begin{array}{l}{\hat{\delta }}^{2}=\frac{1}{m}{\left({{{{{\boldsymbol{\Gamma }}}}}}-{{{{{\bf{F}}}}}}\hat{{{{{{\boldsymbol{\beta }}}}}}}\right)}^{T}{{{{{{\boldsymbol{R}}}}}}}^{-1}({{{{{\boldsymbol{Y}}}}}}-{{{{{\boldsymbol{F}}}}}}\hat{{{{{{\boldsymbol{\beta }}}}}}})\\ \hat{{{{{{\boldsymbol{\beta }}}}}}}={({{{{{{\boldsymbol{F}}}}}}}^{T}{{{{{{\boldsymbol{R}}}}}}}^{-1}{{{{{\boldsymbol{F}}}}}})}^{-1}{{{{{{\boldsymbol{F}}}}}}}^{T}{{{{{{\boldsymbol{R}}}}}}}^{-1}{{{{{\boldsymbol{\Gamma }}}}}} \hfill \end{array}\right.$$

(6)

where ${{{{{\bf{F}}}}}}{{{{{\boldsymbol{=}}}}}}{{{{{{\boldsymbol{[}}}}}}{{{{{{\boldsymbol{f}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}_{1}\right){{{{{\boldsymbol{,}}}}}} \; {{{{{{\boldsymbol{f}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}_{2}\right){{{{{\boldsymbol{,}}}}}} \; {{{{{{\boldsymbol{f}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}_{3}\right){{{{{\boldsymbol{,}}}}}} \ldots {{{{{\boldsymbol{\ldots }}}}}}{{{{{\boldsymbol{,}}}}}} \; {{{{{{\boldsymbol{f}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}_{N}\right){{{{{\boldsymbol{]}}}}}}}^{T}$ is a regression coefficient matrix. ${R}_{{ij}}=R({{{{{\boldsymbol{\theta }}}}}}{{{{{\boldsymbol{,}}}}}}\,{{{{{{\boldsymbol{x}}}}}}}_{i},\,{{{{{{\boldsymbol{x}}}}}}}_{j})\,\left(i,\; j=1,2,3,\ldots,\; N\right)$ is a correlation matrix. ${{{{{\boldsymbol{Y}}}}}}$ is the target value of dataset ${{{{{\boldsymbol{x}}}}}}$. The parameter ${{{{{\boldsymbol{\theta }}}}}}$ of the model can be obtained as follows:

$${{{{{\boldsymbol{\theta }}}}}}={\arg \min} {|{{{{{\boldsymbol{R}}}}}}|}^{1/n}{\hat{\delta }}^{2}$$

(7)

In KS models, for an unknown point ${{{{{{\boldsymbol{x}}}}}}}^{u}$, the predicted value of $\hat{\Gamma }\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)$ can be calculated using a Gaussian random process, wherein $\hat{\Gamma }\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)$ follows a Gaussian distribution with mean ${\mu }_{\hat{\Gamma }}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)$ and variance ${\hat{\delta }}_{\hat{\Gamma }}^{2}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)$. This can be expressed as $\hat{\Gamma }\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right) \sim N({\mu }_{\hat{\Gamma }}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right){{{{{\boldsymbol{,}}}}}}\,{\hat{\delta }}_{\hat{\Gamma }}^{2}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right))$. The predicted mean value ${\mu }_{\hat{\Gamma }}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)$ and the predicted variance ${\hat{\delta }}_{\hat{\Gamma }}^{2}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)$ are given as

$$\left\{\begin{array}{l}{\mu }_{\hat{\Gamma }}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)={{{{{{\boldsymbol{f}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)\hat{{{{{{\boldsymbol{\beta }}}}}}}+{{{{{{\boldsymbol{r}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right){{{{{{\boldsymbol{R}}}}}}}^{-1}({{{{{\boldsymbol{Y}}}}}}-{{{{{\boldsymbol{F}}}}}}\hat{{{{{{\boldsymbol{\beta }}}}}}}) \hfill \\ {\hat{\delta }}_{\hat{\Gamma }}^{2}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)={\hat{\delta }}^{2}-\left[{{{{{{\boldsymbol{f}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right){{{{{{\boldsymbol{r}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right){{{{{{\boldsymbol{R}}}}}}}^{-1}\right]{\left(\begin{array}{cc}{{{{{\bf{0}}}}}} & {{{{{{\boldsymbol{F}}}}}}}^{T}\\ {{{{{\boldsymbol{F}}}}}} & {{{{{\boldsymbol{R}}}}}}\end{array}\right)}^{-1}\left[\begin{array}{c}{{{{{\boldsymbol{f}}}}}}({{{{{{\boldsymbol{x}}}}}}}^{u})\\ {{{{{\boldsymbol{r}}}}}}\left(({{{{{{\boldsymbol{x}}}}}}}^{u})\right.\end{array}\right]\end{array}\right.$$

(8)

where ${{{{{{\boldsymbol{r}}}}}}}^{T}\left({{{{{{\boldsymbol{x}}}}}}}^{u}\right)=[{{{{{\boldsymbol{R}}}}}}\left({{{{{\boldsymbol{\theta }}}}}},\, {{{{{{\boldsymbol{x}}}}}}}^{u},\, {{{{{{\boldsymbol{x}}}}}}}_{1}\right),\, {{{{{\boldsymbol{R}}}}}}\left({{{{{\boldsymbol{\theta }}}}}},\, {{{{{{\boldsymbol{x}}}}}}}^{u},\, {{{{{{\boldsymbol{x}}}}}}}_{2}\right),\,{{{{{\boldsymbol{R}}}}}}\left({{{{{\boldsymbol{\theta }}}}}},\, {{{{{{\boldsymbol{x}}}}}}}^{u},\, {{{{{{\boldsymbol{x}}}}}}}_{3}\right),\ldots,{{{{{\boldsymbol{R}}}}}}\left({{{{{\boldsymbol{\theta }}}}}},\, {{{{{{\boldsymbol{x}}}}}}}^{u},\, {{{{{{\boldsymbol{x}}}}}}}_{N}\right)]$ is the correlation vector between input training points.

KS models aim to construct the most accurate surrogate model with a small amount of training data. Therefore, they require an appropriate sampling strategy⁶¹. The mixed adaptive sampling strategy is widely used due to its advantages of spatial filling and adaptability¹¹. The hybrid adaptive sampling method selects a new sampling point by maximizing the quality parameter ${s}_{j}$ at each candidate sampling point ${{{{{{\boldsymbol{x}}}}}}}_{j}\left(j={{{{\mathrm{1,2,3}}}}},\ldots k,\right.$ where k is the number of candidate sample points. ${s}_{j}$ is calculated as follows:

$${s}_{j}=\frac{{d}_{j}}{\max ({d}_{j})}+\frac{{\hat{\delta }}_{j}^{2}}{\max ({\hat{\delta }}_{j}^{2})}$$

(9)

where ${d}_{j}$ represents the minimum Euclidean distance between the j-th candidate sample point and the current design sample point (including the initial and newly inserted sample points). ${\hat{\delta }}_{j}^{2}$ is the predicted variance at the j-th candidate sample point that is directly provided by a KS model.

Artificial Neural Network (ANN) Model

ANNs simulate the functionality of biological neurons to process various nonlinear information inputs and accurately predict output results after fully learning the input information characteristics, as shown in Fig. 1f. The construction of ANN models comprises three main steps: (1) defining the input and output of the information, (2) training the hidden and output layers of the ANN by calculating weights and continuously reducing errors, and (3) evaluating the ANN accuracy by comparing the predicted and actual values. In this study, the input and output layers are kept fixed. The input layer includes the static compaction force (F) and cement content (C), while the output layer includes the unconfined compressive strength (σ), porosity (P), and coefficient of permeability (K), representing a typical regression problem.

The accuracy of data prediction using an ANN model depends on the quality of the training set, but obtaining large amounts of data from laboratory tests is not economical. Therefore, this study proposes a KS model to reduce the number of experiments and obtain high-quality data with real physical significance for training the ANN model. The proposed learning rate for the ANN model is set as 0.002, the maximum number of iterations is 2000, the convergence error is 0.001, and the ratio of the training set to the test set is 9:1. Before commencing the training process, the input data ${{{{{\boldsymbol{x}}}}}}$ are standardized and the output data are kept unchanged, as is customary. The standardized dataset ${{{{{{\boldsymbol{x}}}}}}}^{{{{{{\boldsymbol{*}}}}}}}$ has a mean value of 0 and a standard deviation of 1. The data are standardized using the following formula:

$${{{{{{\boldsymbol{x}}}}}}}^{{{{{{\boldsymbol{*}}}}}}}=\frac{{{{{{\boldsymbol{x}}}}}}-\mu }{\eta }$$

(10)

where ${{{{{{\boldsymbol{x}}}}}}}^{{{{{{\boldsymbol{*}}}}}}}$ and ${{{{{\boldsymbol{x}}}}}}$ represent the standardized and original input data, respectively. Additionally, $\mu$ and $\eta$ are the mean and variance of data ${{{{{\boldsymbol{x}}}}}}$, respectively.

Moreover, the mean squared error (MSE) between the predicted and actual values is calculated using Eq. (11). The proposed ANN model aims to minimize the MSE during the iterative process, thus achieving high predictive accuracy.

$${MSE}=\frac{1}{N}{\sum}_{i=1}^{N}{({Y}_{i}^{t}-{Y}_{i}^{p})}^{2}$$

(11)

where N represents the quantity of data in the testing dataset. ${Y}_{i}^{t}$ and ${Y}_{i}^{p}$ are the actual and predicted values in the testing dataset, respectively.

To assess the accuracy of the proposed ANN model, the coefficient of determination (${R}^{2}$) is employed to evaluate both the training and prediction datasets to prevent underfitting or overfitting of the ANN model. The coefficient of determination is defined as follows:

$${R}^{2}=\frac{{\sum }_{i=1}^{N}{Y}_{i}^{p}-{\bar{Y}}_{i}^{p}}{{\sum }_{i=1}^{N}{Y}_{i}^{{actual}}-{\bar{Y}}_{i}^{{actual}}}$$

(12)

where ${Y}_{i}^{{actual}}$ represents the actual data of the testing dataset. ${\bar{Y}}_{i}^{p}$ and ${\bar{Y}}_{i}^{{actual}}$ represent the average predicted and actual values of the testing dataset, respectively. The constructed KS-ANN model can promptly and accurately predict the hydromechanical performance indicators of PCBMs under different sample preparation conditions across a wide range.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data generated in this study are provided in the Supporting Information/Source Data file. Data files for model training and testing generated in this study have been deposited in the public GitHub (https://github.com/YM008/Supporting-Materials.git) without any restrictions. The data that support the plots within this paper and other findings of this study are available from the corresponding authors upon request. Source data are provided in this paper.

Code availability

The Python code to implement the machine learning tasks in this study has been deposited in the public database Zenodo⁶² without any restrictions.

References

Uncuoglu, E. et al. Comparison of neural network, Gaussian regression, support vector machine, long short-term memory, multi-gene genetic programming, and M5 Trees methods for solving civil engineering problems. Appl. Soft Comput. 129, 109623 (2022).
Article Google Scholar
Charrier, M. & Ouellet-Plamondon, C. M. Artificial neural network for the prediction of the fresh properties of cementitious materials. Cem. Concr. Res. 156, 106761 (2022).
Article CAS Google Scholar
Eskandari-Naddaf, H. & Kazemi, R. ANN prediction of cement mortar compressive strength, influence of cement strength class. Constr. Build. Mater. 138, 1–11 (2017).
Article Google Scholar
Pham, V.-N., Do, H.-D., Oh, E. & Ong, D. E. L. Prediction of unconfined compressive strength of cement-stabilized sandy soil in Vietnam using artificial neural networks (ANNs) model. Int. J. Geotech. Eng. 15, 1177–1187 (2021).
Article CAS Google Scholar
Bui, D.-K., Nguyen, T., Chou, J.-S., Nguyen-Xuan, H. & Ngo, T. D. A modified firefly algorithm-artificial neural network expert system for predicting compressive and tensile strength of high-performance concrete. Constr. Build. Mater. 180, 320–333 (2018).
Article Google Scholar
Oezcan, F., Atis, C. D., Karahan, O., Uncuoglu, E. & Tanyildizi, H. Comparison of artificial neural network and fuzzy logic models for prediction of long-term compressive strength of silica fume concrete. Adv. Eng. Softw. 40, 856–863 (2009).
Article Google Scholar
Rehman, F., Khokhar, S. A. & Khushnood, R. A. ANN based predictive mimicker for mechanical and rheological properties of eco-friendly geopolymer concrete. Case Stud. Constr. Mater. 17, e01536 (2022).
Google Scholar
Peng, Y. & Unluer, C. Modeling the mechanical properties of recycled aggregate concrete using hybrid machine learning algorithms. Resour. Conserv. Recycling 190, 106812 (2023).
Article CAS Google Scholar
Gulbandilar, E. & Kocak, Y. Application of expert systems in prediction of flexural strength of cement mortars. Comput. Concr. 18, 1–16 (2016).
Article Google Scholar
Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 61, 85–117 (2015).
Article PubMed Google Scholar
Eason, J. & Cremaschi, S. Adaptive sequential sampling for surrogate model generation with artificial neural networks. Computers Chem. Eng. 68, 220–232 (2014).
Article CAS Google Scholar
Wang, J., Lu, Z. & Wang, L. A novel method for estimating the failure possibility by combining the adaptive Kriging model with the Markov chain simulation. Aerosp. Sci. Technol. 119, 107205 (2021).
Article Google Scholar
Ma, Y.-Z. et al. Adaptive Kriging-based failure probability estimation for multiple responses. Reliab. Eng. Syst. Saf. 228, 108771 (2022).
Article Google Scholar
Zeng, P., Sun, X., Xu, Q., Li, T. & Zhang, T. 3D probabilistic landslide run-out hazard evaluation for quantitative risk assessment purposes. Eng. Geol. 293, 106303 (2021).
Article Google Scholar
Sun, X. et al. From probabilistic back analyses to probabilistic run-out predictions of landslides: a case study of Heifangtai terrace, Gansu Province, China. Eng. Geol. 280, 105950 (2021).
Article Google Scholar
García-Macías, E. & Ubertini, F. Real-time Bayesian damage identification enabled by sparse PCE-Kriging meta-modelling for continuous SHM of large-scale civil engineering structures. J. Build. Eng. 59, 105004 (2022).
Article Google Scholar
Forrester, A. I. J., Sóbester, A. & Keane, A. J. Engineering Design via Surrogate Modelling: A Practical Guide. (Wiley, 2008).
Sacks, J., Schiller, S. B. & Welch, W. J. Designs for Computer Experiments. Technometrics 31, 41–47 (1989).
Article MathSciNet Google Scholar
Su, J., Yu, X., Wang, X., Wang, Z. & Chao, G. Enhanced transfer learning with data augmentation. Eng. Appl. Artif. Intell. 129, 107602 (2024).
Article Google Scholar
Cole, J. M. A Design-to-Device Pipeline for Data-Driven Materials Discovery. Acc. Chem. Res. 53, 599–610 (2020).
Article CAS PubMed Google Scholar
Qu, T., Zhao, J., Guan, S. & Feng, Y. T. Data-driven multiscale modelling of granular materials via knowledge transfer and sharing. Int. J. Plasticity 171, 103786 (2023).
Article Google Scholar
Valikhani, A., Jahromi, A. J., Pouyanfar, S., Mantawy, I. M. & Azizinamini, A. Machine learning and image processing approaches for estimating concrete surface roughness using basic cameras. Computer-Aided Civ. Infrastruct. Eng. 36, 213–226 (2021).
Article Google Scholar
Cai, X., Wu, K., Huang, W., Yu, J. & Yu, H. Application of recycled concrete aggregates and crushed bricks on permeable concrete road base. Road. Mater. Pavement Des. 22, 2181–2196 (2021).
Article CAS Google Scholar
Wang, Z., Zou, D., Liu, T., Zhou, A. & Shen, M. A novel method to predict the mesostructure and performance of pervious concrete. Constr. Build. Mater. 263, 120117 (2020).
Article Google Scholar
Tan, X., Hu, Z., Li, W., Zhou, S. & Li, T. Micromechanical numerical modelling on compressive failure of recycled concrete using Discrete Element Method (DEM). Materials 13, 4329 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Cantero, B. Effect of the recycled aggregate on the performance of the granular skeleton. Mater. J. 117, 113–124 (2020).
Zhao, Z., Wang, S., Ren, J., Wang, Y. I. & Wang, C. Fatigue characteristics and prediction of cement-stabilized cold recycled mixture with road-milling materials considering recycled aggregate composition. Constr. Build. Mater. 301, 124122 (2021).
Article Google Scholar
Vardaka, G., Thomaidis, K., Leptokaridis, C. & Tsimas, S. Use of steel slag as coarse aggregate for the production of pervious concrete. J. sustain. dev. energy water environ. syst. 2, 30–40 (2014).
Article Google Scholar
Zaetang, Y., Wongsa, A., Sata, V. & Chindaprasirt, P. Use of lightweight aggregates in pervious concrete. Constr. Build. Mater. 48, 585–591 (2013).
Article Google Scholar
Xie, X. et al. Mixture proportion design of pervious concrete based on the relationships between fundamental properties and skeleton structures. Cem. Concr. Compos. 113, 103693 (2020).
Article CAS Google Scholar
Sandoval, G. F., Galobardes, I., Teixeira, R. & Toralles, B. M. Comparison between the falling head and the constant head permeability tests to assess the permeability coefficient of sustainable Pervious Concretes. 7 https://doi.org/10.1016/j.cscm.2017.09.001 (Elsevier, 2017).
sandoval, G. F. B., reyes, I. G., Schwantes-Cezario, N., Moura, A. C. & Toralles, B. M. Correlation between Permeability and Porosity for Pervious Concrete (PC). DYNA https://doi.org/10.15446/DYNA.V86N209.77613 (2019).
Article Google Scholar
Sandoval, G. F., Moura, A. C. D., Jussiani, E., Andrello, A. & Toralles, B. M. Proposal of maintenance methodology for pervious concrete (PC) after the phenomenon of clogging. 248 https://doi.org/10.1016/j.conbuildmat.2020.118672 (Elsevier, 2020).
Sandoval, G. F., Galobardes, I., Campos, A. & Toralles, B. M. Assessing the phenomenon of clogging of pervious concrete (Pc): experimental test and model proposition. https://doi.org/10.1016/j.jobe.2020.101203 (2020).
Zhong, R. & Wille, K. Linking pore system characteristics to the compressive behavior of pervious concrete.https://doi.org/10.1016/J.CEMCONCOMP.2016.03.016 (2016).
Deo, O. & Neithalath, N. Compressive behavior of pervious concretes and a quantification of the influence of random pore structure features. Mater. Sci. Eng.: A 528, 402–412 (2010).
Article Google Scholar
Fascetti, A., Ichimaru, S. & Bolander, J. E. Stochastic lattice discrete particle modeling of fracture in pervious concrete. Computer-Aided Civil and Infrastructure Engineering. https://doi.org/10.1111/mice.12816 (2022).
Cavalaro, S. H. P., Blanco, A. & Pieralisi, R. Holistic modelling approach for special concrete: from fresh- to hardened-state. RILEM Tech. Lett. 3, 84–90 (2019).
Article Google Scholar
Pieralisi, R., Cavalaro, S. H. P. & Aguado, A. Discrete element modelling of mechanical behaviour of pervious concrete. Cem. Concr. Compos. 119, 104005 (2021).
Article CAS Google Scholar
Wang, X. et al. Forensic analysis and numerical simulation of a catastrophic landslide of dissolved and fractured rock slope subject to underground mining. Landslides 19, 1045–1067 (2022).
Article Google Scholar
Rodrigues, E. A., Manzoli, O. L., Bitencourt, L. A. G., Bittencourt, T. N. & Sánchez, M. An adaptive concurrent multiscale model for concrete based on coupling finite elements. Computer Methods Appl. Mech. Eng. 328, 26–46 (2018).
Article ADS Google Scholar
Huang, Y., Yang, Z., Zhang, H. & Natarajan, S. A phase-field cohesive zone model integrated with cell-based smoothed finite element method for quasi-brittle fracture simulations of concrete at mesoscale. Computer Methods Appl. Mech. Eng. 396, 115074 (2022).
Article ADS MathSciNet Google Scholar
Pieralisi, R., Cavalaro, S. H. P. & Aguado, A. Advanced numerical assessment of the permeability of pervious concrete. Cem. Concr. Res. 102, 149–160 (2017).
Article CAS Google Scholar
Nguyen, H.-Q., Tran, B.-V. & Vu, T.-S. Numerical approach to predict the flexural damage behavior of pervious concrete. Case Stud. Constr. Mater. 16, e00946 (2022).
Google Scholar
Vu, V.-H., Tran, B.-V., Le, B.-A. & Nguyen, H.-Q. Prediction of the relationship between strength and porosity of pervious concrete: A micromechanical investigation. Mech. Res. Commun. 118, 103791 (2021).
Article Google Scholar
Sumanasooriya, M. S. & Neithalath, N. Pore structure features of pervious concretes proportioned for desired porosities and their performance prediction. Cem. Concr. Compos. 33, 778–787 (2011).
Article CAS Google Scholar
Le, B.-A., Tran, B.-V., Vu, T.-S., Vu, V.-H. & Nguyen, V.-H. Predicting the Compressive Strength of Pervious Cement Concrete based on Fast Genetic Programming Method. Arab J. Sci. Eng. https://doi.org/10.1007/s13369-023-08396-2 (2023).
Article Google Scholar
Zhang, J., Huang, Y., Ma, G., Sun, J. & Nener, B. A metaheuristic-optimized multi-output model for predicting multiple properties of pervious concrete. Constr. Build. Mater. 249, 118803 (2020).
Article Google Scholar
Pieralisi, R., Cavalaro, S. H. P. & Aguado, A. Discrete element modelling of the fresh state behavior of pervious concrete. Cem. Concr. Res. 90, 6–18 (2016).
Article CAS Google Scholar
Martins Filho, S. T., Pieralisi, R. & Lofrano, F. C. Framework to characterize nonlinear flow through pervious concrete. Cem. Concr. Res. 151, 106633 (2022).
Article CAS Google Scholar
Zhao, X., Dong, Q., Chen, X., Han, H. & Zhang, T. Evaluation of fatigue performance of cement-treated composites based on residual strength through discrete element method. Constr. Build. Mater. 306, 124904 (2021).
Article Google Scholar
Dong, Q., Zheng, D., Zhao, X., Chen, X. & Chen, Y. Mesoscale numerical simulation of fracture of cement treated base material during semi circular bending test with discrete element model. Constr. Build. Mater. 261, 119981 (2020).
Article Google Scholar
Xiao, Y. et al. Evaluating gyratory compaction characteristics of unbound permeable aggregate base materials from meso-scale particle movement measured by smart sensing technology. Materials 14, 4287 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, J. H. & Zhang, X. S. Talking about the application of cement stabilized open gradation with crushed stone permeable base. Sci. Technol. 202 https://doi.org/10.19392/j.cnki.1671-7341.2010.17.178 (2010).
Article Google Scholar
Mienye, I. D. & Sun, Y. A survey of ensemble learning: concepts, algorithms, applications, and prospects. IEEE Access 10, 99129–99149 (2022).
Article Google Scholar
Booker, A. Design and analysis of computer experiments. In 7th AIAA/USAF/NASA/ISSMO Symposium on Multidisciplinary Analysis and Optimization (American Institute of Aeronautics and Astronautics, St. Louis, MO, U.S.A., 1998). https://doi.org/10.2514/6.1998-4757.
Wei, N. & Lu, Z. Sequential optimization method based on the adaptive Kriging model for the possibility-based design optimization. Aerosp. Sci. Technol. 130, 107939 (2022).
Article Google Scholar
Zhan, D., Cheng, Y. & Liu, J. Expected improvement matrix-based infill criteria for expensive multiobjective optimization. IEEE Trans. Evol. Computat. 21, 956–975 (2017).
Article Google Scholar
Forrester, A. I. J., Sóbester, A. & Keane, A. J. Multi-fidelity optimization via surrogate modelling. Proc. R. Soc. A. 463, 3251–3269 (2007).
Article ADS MathSciNet Google Scholar
Kennedy, M. Predicting the output from a complex computer code when fast approximations are available. Biometrika 87, 1–13 (2000).
Article MathSciNet Google Scholar
Mo, S., Shi, X., Lu, D., Ye, M. & Wu, J. An adaptive Kriging surrogate method for efficient uncertainty quantification with an application to geological carbon sequestration modeling. Computers Geosci. 125, 69–77 (2019).
Article ADS Google Scholar
Wang, X. M. et al. Kriging-based surrogate data-enriching artificial neural network prediction of strength and permeability of permeable cement-stabilized base. Zenodo https://doi.org/10.5281/ZENODO.10987911 (2024).
Article Google Scholar

Download references

Acknowledgements

This work was jointly supported by the Science Fund for Distinguished Young Scholars of Hunan Province, China (2024JJ2073 to Y.X.), National Natural Science Foundation of China (52178443, 51878673, & 51808577 to Y.X.), the National Key R&D Program of China (2019YFC1904704 to Y. Chen, Z. Li, and Y.X.), and the Fundamental Research Funds for the Central Universities of Central South University, China (2023zzts0405, 2021zzts0227, 2021zzts0223 & 2022ZZTS0744 to X. Wang, M. Wang, and W. Li). The computing resources provided by the High-Performance Computing Center of Central South University are gratefully acknowledged.

Author information

Authors and Affiliations

School of Civil Engineering, Central South University, Changsha, China
Xiaoming Wang, Yuanjie Xiao, Wenqi Li & Meng Wang
Ministry of Education (MOE) Key Laboratory of Engineering Structures of Heavy Haul Railway (Central South University), Changsha, China
Yuanjie Xiao
The Second Xiangya Hospital of Central South University, Changsha, China
Yanbin Zhou
Hunan Communications Research Institute CO., LTD., Changsha, China
Yuliang Chen & Zhiyong Li

Authors

Xiaoming Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuanjie Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Wenqi Li
View author publications
You can also search for this author in PubMed Google Scholar
Meng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yanbin Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yuliang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyong Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Xiaoming Wang: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. Yuanjie Xiao: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing–review & editing. Wenqi Li: Data curation, Formal analysis, Investigation, Validation, Writing – review & editing. Meng Wang: Data curation, Formal analysis, Investigation, Validation, Writing – review & editing. Yanbin Zhou: Data curation, Methodology, Visualization, Writing – review & editing. Yuliang Chen: Conceptualization, Investigation, Methodology, Supervision, Validation, Writing – review & editing. Zhiyong Li: Conceptualization, Investigation, Methodology, Supervision, Validation, Writing – review & editing.

Corresponding author

Correspondence to Yuanjie Xiao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Mingjing Fang and Bao-Viet Tran for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, X., Xiao, Y., Li, W. et al. Kriging-based surrogate data-enriching artificial neural network prediction of strength and permeability of permeable cement-stabilized base. Nat Commun 15, 4891 (2024). https://doi.org/10.1038/s41467-024-48766-4

Download citation

Received: 07 August 2023
Accepted: 06 May 2024
Published: 07 June 2024
DOI: https://doi.org/10.1038/s41467-024-48766-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Indirect prediction of graphene nanoplatelets-reinforced cementitious composites compressive strength by using machine learning approaches

Data-driven prediction on critical mechanical properties of engineered cementitious composites based on machine learning

Estimation of compressive strength of waste concrete utilizing fly ash/slag in concrete with interpretable approaches: optimization and graphical user interface (GUI)

Introduction

Results

Experimental design and results

Correlation analysis of the unconfined compressive strength, porosity, permeability coefficient, compaction force, and cement content of PCBMs

Implementation process and result analysis of the KS-ANN model

Analysis of Markov Chain Monte Carlo (MCMC) simulation results

Prediction results of the KS model

Prediction results of the KS-ANN models

Data-driven design optimization concept of the PCBM

Design optimization function for PCBMs

Methods

Framework of the KS-ANN model

Kriging-based surrogate (KS) model

Artificial Neural Network (ANN) Model

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links