Introduction

In recent years, phthalic acid esters (PAEs) have caught extensive concerns because they are widely used as plastic plasticizers, and additives in more than a hundred varieties of products, such as toy, packing material cosmetics production. Therefore, these persistent and toxic organic compounds, which could harm the health of organisms and human by transmission of food chain and bioaccumulation, commonly exists in various environments. Dimethyl phthalate (DMP) known as one of the most important and extensively used PAEs, has been already measured in various environment, such as various surface water, groundwater, sediments of water, atmosphere, aerosol particle, soil1,2,3,4, and is known to likely cause dysfunctions of the endocrine systems, liver, and nervous systems of humans and animals5,6. Therefore, DMP has been listed as a priority control pollutant by American Environmental Protection Agency (USEPA)7, Ministry of Environmental Protection of the People’s Republic of China8 and European Union (EU)9.

In the past few years, previous studies have demonstrated that several PAEs can take the biodegradation under aerobic conditions and anaerobic conditions10,11,12,13,14,15, in activated sludge16,17 and in acclimated sludge18,19,20,21. In order to clearly investigate the degradation mechanism and behavior of DMP in the treatment system, various mathematical models are proposed to describe the degradation and behavior of DMP. However due to the highly nonlinearity and complexity of degradation mechanism for DMP, traditional mathematical methods are hard to exactly to model and simulate the biodegradation process22.

In recent years, artificial Intelligence (AI), which can overcome the restrictions of the traditional modeling methods and efficiently approximate any nonlinear processes, have been utilized for simulation, prediction and modeling23,24,25. Among AI methodologies and approaches, neural network (NN) is the most known and popular and has been widely used on account of its universal approximation properties26. Although NN can be used for forecasting the effluent quality parameters from wastewater treatment process (WWTP), there are also some shortcomings for NN, such as easily getting into local minima, low learning efficiency, slow convergence rate and difficultly extracting the mapping rules and so on27,28.

To solve the drawbacks of NN, a great number of new hybrid intelligent techniques have been constructed, such as fuzzy neural network and wavelet neural network (WNN). WNNs, which take the advantages of NN and WT, are designed by using wavelet functions as the neuron’s activation functions and can be regarded as the function-linked networks based wavelet function. Due to the good time-frequency localization characteristics of wavelets, wavelet function is an important tool in functional approximation. Therefore, the learning and memory ability of WNN is more efficient than conventional NN in the light of network size, convergence rate and accuracy29,30. Nevertheless, there is also a shortcoming for WNN31, which is difficult to understand the mapping rules. This is exactly the advantage of fuzzy logic (FL).

Therefore, combining the advantages of NN, FL and WT, a novel hybrid intelligent technique- fuzzy wavelet neural network (FWNN), which make effective use of self learning and memory abilities of NN, handling uncertainty capacity of FL and analyzing local details superiority of WT, could be constructed to enhance the abilities of approximation accuracy, convergence rate and generalization26. So compared with other conventional modeling techniques, the hybrid FWNN provide a more powerful way for process modeling, simulation and optimizing, particularly for complex wastewater treatment process.

In this work, a novel FWNN, which uses the concepts of FL in combination with WNN, was proposed for modeling and simulating biodegradation process of DMP in an AAO wastewater treatment process. The degradation and behavior of DMP were investigated, a degradation model including biodegradation and sorption using the proposed FWNN model was formulated so as to evaluate the fate of DMP. In order to avoid the trial-and-error process and the impact coming from random initialization, a hybrid learning algorithm integrating an improved genetic algorithm (GA) and gradient descent algorithm (GDA) was adopted.

Materials and Methods

Reactor system

As shown in Fig. 1, the AAO treatment system made of polyethylene includes mainly four parts: one anaerobic zone with volume of 40 litres, one anoxic zone with volume of 40 litres, three aerobic zone with 160 litres and one settling zone. There were two motor-driven stirrers employed in anaerobic and anoxic zones. An air blower was used to supply oxygen to the microorganisms of aerobic zone. A peristaltic pump was employed to automatically furnish the system from the feed tank. The mixed liquor passing through the aerobic zones was recycled to the anoxic zone, and the sludge in the settling zone was returned back to the anaerobic zone. The reflux ratios of the mixed liquor and sludge were same, and set to 1. The sludge from a sewage treatment plant in Guangzhou was cultivated in a laboratory scale AAO treatment system with synthetic wastewater as feed. The synthetic wastewater with five different concentrations of DMP (>99% purity, Sinopharm Chemical Reagent Co., Ltd), which included 30, 40, 50, 60, and 80 μg L−1, was used.

Figure 1: Schematic diagram of AAO system.
figure 1

(a) regulating tank, (b) anaerobic zone, (c) anoxic zone, (d) aerobic zone, (e) settler, (f) computer monitoring system, (g) inlet pump, (h) reflux pump for mixed liquor, (i) return sludge pump, (j) air blower, (k) the wasted sludge, (l) mixer, (m) the signal collecting for DO, ORP, pH, and Q.

In order to maintaining at a constant temperature of 25 °C, the work environment reactor system was controlled by the temperature control system. Dissolved oxygen (DO) was measured by the online dissolved oxygen meter (D53, HACH), and the concentrations of DO in anaerobic, anoxic and aerobic zones were within the scope of 0 to 0.30 mg L−1, 0 to 0.60 mg L−1 and 2.54 to 5.72 mg L−1, respectively. The mixed liquor suspended solid (MLSS) concentration of about 3000 mg L−1 was controlled in the reactor system. On the basis of changing the influent pump flow, hydraulic retention time (HRT) would be adjusted. Just as well sludge retention time (SRT) would be adjusted through altering the amount of the discharged excess sludge in the bottom of the settling zone. The continuous period of the operated system was one year. The basic information of Reactor system can been shown on supplementary information.

Gas Chromatography (Agilent 7890A, USA) and Mass Spectrometry (Agilent 5975, USA) (GSMS) was used to for determination and identification of the concentration of DMP. The detailed detection method was described by Huang et al.32. Mixed liquor suspended solid (MLSS) was measured according to Standard Methods33.

Fuzzy wavelet neural networks (FWNN)

Structure of the proposed-FWNN

Figure 2 shows the structure of FWNN possessing five layers, which utilize wavelet functions as the neuron’s activation functions and realize fuzzy logical rules through five-layer NN34.

Figure 2
figure 2

Architecture of the proposed fuzzy wavelet neural network system.

The first layer is input layer consisting of a group of processing units which are responsible for acceptance of data x1; x2; … ; xn imported to the network. In this work, the number of input nodes is 5.

The second layer is the fuzzified layer. In this layer, the input characteristic variables from the first layer are translated into fuzzy variables through using the membership function, which is based on Gaussian function. The outputs of the layer are shown as below:

where i is the number of input singles and j is the number of the fuzzy rules in third layer. cij and σij are the center position and the spread of Gaussian function. Fj (xi) is the membership function of the ith input variable with the jth fuzzy rule.

The third layer is called as fuzzy rule layer, which is used to realize the logical inference based on the fuzzy rule. Multiplication is used as AND operator here. The output of the jth node in this layer is

where μj (x) is the input single for the next layer and n is the number of fuzzy rule.

The fourth layer called as the wavelet network layer, which is used for data denoising transform. The product of the output from the layer 3 and 4 is set the input only to the layer 5. The output of the jth wavelet neuron in this layer is calculated by the following equation.

where and wj is the layer weight between jth wavelon and output node, the dilation (scale) parameter aij controls the spread of the wavelet and translation (shift) parameter bij determines its central position.

The fifth layer is the output layer. this layer calculates the overall output as the summation of output of previous layers. In this work, the output is the predicted effluent DMP of the FWNN model.

Training algorithm to optimize the proposed-FWNN

In this work, a hybrid learning algorithm based on gradient descent algorithm (GDA) and genetic algorithm (GA) was employed for adjust the parameters of the proposed FWNN, which included the center and width parameters of Gaussian functions (cij and σij), dilation and translation parameters of wavelet functions (aij and bij), and the weight of the wavelet networks (wj)35.

GA was firstly used for the initialization of the proposed FWNN, then GDA was employed to obtain the optimal parameters of the FWNN. The advantages of the hybrid learning algorithm are obvious. Firstly, compared with only one optimization algorithm (GA or GDA), it brings more stable training process. Secondly, due to the “similarity” phenomenon existing in the population genetic of GA, GA with GDA can speed the convergence of the training process.

Results and Discussion

Kinetics of DMP degradation

In order to describing exactly the degradation behavior of DMP in AAO treatment system, the degradation models including biodegradation and sorption according to the Activated Sludge Model (ASM2) was developed based on the fate of DMP, which had been described by Huang et al.32,36

where rh (anaereobic), rh (anoxic), and rh (aerobic) are the anaerobic hydrolysis process rate, anoxic hydrolysis process rate and aerobic hydrolysis process rate, respectively; ηFe and are anaerobic hydrolysis reduction factor and anoxic hydrolysis reduction factor; K, and are saturation/inhibition coefficient for nitrate and saturation/inhibition coefficient for oxygen, respectively; and Ss are dissolved oxygen and biodegradable substrate; XH is heterotrophic biomass.

In addition, due to the uniformity of the mixed liquors in each reactor, the metabolic rate of DMP by microorganism in AAO system is uniform. Hence, the degradation rate of DMP in AAO treatment process could be described as the following equation:

From what had been mentioned above, the model for describing the degradation rate of DMP could be simplified:

where , , and were represent as the variables of x, y, k and a, respectively. Therefore, equation (8) could be derived to a linear formula for relating the transformed values

Base on the kinetic model of DMP degradation, kinetic parameters of DMP for the anaerobic degradation, anoxic degradation and aerobic degradation shown as in Tables S1–S3 (Supplementary Information) were determined. Thus according to the linear formula (Equation 9), the kinetic parameters of the models (K, Ks and η) shown in Table 1 were calculated. From Table 1, it can be seen that, the parameter η of DMP for the anaerobic degradation, anoxic degradation and aerobic degradation were 0.68, 0.80 and 1.00, respectively. That is because the mixed liquor passing through the aerobic zones was recycled to the anoxic zone, and the sludge in the settling zone was returned back to the anaerobic zone, which caused anaerobic sludge, anoxic sludge and aerobic sludge to have the similar characteristic. Thus. the removal efficiency of DMP in AAO treatment process was higher.

Table 1 Remove kinetics parameters of biodegradation of DMP.

In order to assess the performance of the proposed models, the models were utilize for forecasting the DMP removal efficiency in AAO treatment process. It is very clear from Table 1 that the modeling approach gave good predictions. The forecasting errors were very small, mean absolute percentage error (MAPE) and root mean squared error (RMSE) of the model were both small, and the average value of relative errors were below 15%. The results clearly indicated the proposed model can describe exactly the degradation behavior of DMP in AAO treatment system due to the degradation model including biodegradation and sorption.

However due to the highly nonlinearity and complexity of degradation mechanism for DMP, traditional mathematical methods are hard to exactly to model and simulate the biodegradation process. Moreover, it was very difficult to establish the kinetics parameters of the mechanism model, artificial intelligence technique which can overcome the restrictions of the traditional modeling methods and efficiently approximate any nonlinear processes was used to model biodegradation of DMP. Therefore, With the self learning and memory abilities of NN, handling uncertainty capacity of FL, analyzing local details superiority of WT and global search of GA, a novel FWNN combines WNN with a TSK fuzzy model in order to enhance the function approximation accuracy.

Modeling with FWNN

Data collection and preprocessing

The main objective of the data preprocessing is to determine suitable locations for the data required for modeling activities. In this work, the relationship between degradation of DMP and ORP, DO, pH, MLSS were selected to explore. Thus in order to develop the FWNN model, 50 sets of data was obtained from an AAO process 50 sets of data were obtained in the whole process, and 35 sets of measured data were selected as training samples and 15 sets of measured data were tested as forecast samples. In order to improve the performance of the model, normalization is one of the mostly used methods in data preprocessing In order to use the data into the network model for training, scaling was performed.

Development of the FWNN model

In this work, the FWNN model was used for forecasting the concentration of DMP. Through analyzing the mechanism of DMP in WWTP, the structure of FWNN model was determined, as shown in Fig. 2. After the initial structure and parameters of FWNN model were determined, a hybrid learning algorithm integrating improved genetic optimization and gradient descent algorithm was employed to train the network. After the structure and parameters of FWNN were optimized by GA, GDA was employed to update the parameters of the network.

Simulation results and analysis

In this work, the forecasting model based on FWNN was implemented on MATLAB. The initial population size N pop is 100, crossover rat Pc is 0.3, the interval of mutation Pm is 0.09, the maximum generation number was 200. Figure 3 shows the training process FWNN. From Fig. 3, it can be seen that the hybrid algorithm had rapid convergence ability and it met the target error rapidly. Thus, the center and width parameters of membership functions (cij and σij), dilation and translation parameters of wavelet functions (aij and bij), and the weight of the wavelet networks (wj) were drawn, as shown in Tables 2 and 3.

Figure 3
figure 3

Training performance of FWNN based on hybrid GA-GDA algorithms.

Table 2 Gaussian function parameters of FWNN.
Table 3 The wavelet layer parameters of FWNN.

The forecasting result of the proposed FWNN for testing datasets are demonstrated in Fig. 4. From Figs 4 and 5, it can be seen that the predicted values agree well with the observed values. On the basis of the simulation results of FWNN, the performance indexes of the proposed FWNN model for testing datasets are shown in Table 4. From Table 4, it can be seen that the proposed FWNN model achieved a very satisfactory prediction performance of effluent DMP in the AAO wastewater process. According to the high R2 of 0.9851, this case illustrates that the correlation between the predicted values and the observed values was excellent. Moreover, on the basis of the high R2, the FWNN model can explain 98.51% of the total variations. Furthermore, according to the values of the other descriptive performance indexes, which were are all nearly zero, it also revealed that the developed FWNN model showed a superior prediction performance, and there was only a small deviation produced by the developed FWNN model.

Figure 4
figure 4

Compared actual output with predicted values based on FWNN.

Figure 5
figure 5

Error curve of training and testing in FWNN model.

Table 4 Predicting performance using FWNN, NN and Kinetic model.

Comparison of three models (FWNN, NN, and kinetic model)

In addition, in order to demonstrate the superiority of FWNN model, the developed FWNN model was compared with NN and kinetic model, and it can been seen that FWNN model has smaller RMSE (or MSE), MAPE and higher R2 values, as shown Table 4. When predicting, R2, MAPE, RMSE and MSE values were 0.9851, 1.8158%, 0.080 and 0.0064 using FWNN, respectively. However when using NN model with GA (GA-NN) and kinetic model, R2 were 0.9361 and 0.9036 respectively, MAPE were 5.0182% and 8.1017% respectively, and RMSE were 0.1658 and 0.2771, and MSE values were 0.02750 and 0.0768 respectively.

It is very clear from Table 4, FWNN model achieves better performances than GA-NN and kinetic model, which illustrates the FWNN model predicting the effluent DMP more accurate than GA-NN model and mechanism model. The results clearly indicates that the FWNN model had the high ability for extracting the dynamic behavior and complex interrelationships from various operation variables in the AAO wastewater treatment process.

Furthermore, compared with the kinetic model and NN model, the proposed FWNN has the advantages as follows: 1) FWNN, which makes effective use of self learning and memory abilities of NN, handling uncertainty capacity of FL and analyzing local details superiority of WT, could be constructed to enhance the abilities of approximation accuracy, convergence rate and generalization; 2) FWNN includes there search of the optimal definitions of parts of fuzzy rules, the determination of the sufficient number of layers and nodes, the parameter initialization of the structure and the training law; 3) Due to the good time-frequency localization characteristics of wavelets, FWNN possess better capability of learning and memory, and are superior to the convergence rate and accuracy; 4) FWNN has very important realistic meanings for optimizing operation parameters of reactor system, simulating the reactor system and enhancing the reactor system stability and efficiency.

Conclusions

A novel FWNN for modeling and simulating biodegradation process of DMP was established in an AAO wastewater treatment process on the basis of the mechanism model. With the self learning and memory abilities of NN, handling uncertainty capacity of FL, analyzing local details superiority of WT and global search of GA, the reasonable forecasting performances had been achieved. Compared with NN model and kinetic model, FWNN model has smaller RMSE (or MSE), MAPE and higher R2 values and FWNN model achieves better performances. Therefore, FWNN is an efficient approach for modeling biodegradation process of DMP.

Additional Information

How to cite this article: Huang, M. et al. A New Efficient Hybrid Intelligent Model for Biodegradation Process of DMP with Fuzzy Wavelet Neural Networks. Sci. Rep. 7, 41239; doi: 10.1038/srep41239 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.