Improving short-term water demand forecasting using evolutionary algorithms

Stańczyk, Justyna; Kajewska-Szkudlarek, Joanna; Lipiński, Piotr; Rychlikowski, Paweł

doi:10.1038/s41598-022-17177-0

Download PDF

Article
Open access
Published: 08 August 2022

Improving short-term water demand forecasting using evolutionary algorithms

Scientific Reports volume 12, Article number: 13522 (2022) Cite this article

4862 Accesses
19 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Modern solutions in water distribution systems are based on monitoring the quality and quantity of drinking water. Identifying the volume of water consumption is the main element of the tools embedded in water demand forecasting (WDF) systems. The crucial element in forecasting is the influence of random factors on the identification of water consumption, which includes, among others, weather conditions and anthropogenic aspects. The paper proposes an approach to forecasting water demand based on a linear regression model combined with evolutionary strategies to extract weekly seasonality and presents its results. A comparison is made between the author's model and solutions such as Support Vector Regression (SVR), Multilayer Perceptron (MLP), and Random Forest (RF). The implemented daily forecasting procedure allowed to minimize the MAPE error to even less than 2% for water consumption at the water supply zone level, that is the District Metered Area (DMA). The conducted research may be implemented as a component of WDF systems in water companies, especially at the stage of data preprocessing with the main goal of improving short-term water demand forecasting.

Machine-learning algorithms for forecast-informed reservoir operation (FIRO) to reduce flood damages

Article Open access 21 December 2021

The impact of the number of high temporal resolution water meters on the determinism of water consumption in a district metered area

Article Open access 02 November 2023

Short-term streamflow modeling using data-intelligence evolutionary machine learning models

Article Open access 24 August 2023

Introduction

Water consumption prediction is one of the main goals in managing water supply infrastructure and water resources. Increasing urbanization, industrialization, population growth and unavoidable climate change cause the demand for drinking water to increase constantly¹. Freshwater resources for water supply are constantly decreasing, and although we are demonstrating more and more conscious and pro-ecological attitudes in terms of water saving, studies show significant water scarcity of water globally and locally, especially in countries with dry climate². Forecasting water demand over different time horizons is crucial in the management of water resources. Depending on the purpose of the observations, developed Water Demand Forecasting (WDF) systems allow predicting water consumption in short- and long-term perspectives. The aim of long-term analyses, considering the horizon of 20–30 years, is to support making decisions related to designing and developing water supply systems. Short-term simulations, usually hourly, daily, or weekly^3,4, are used to optimize the work and energy costs of pump stations (Pump Scheduling Optimization, PSO)⁵ and to solve current operational problems⁶. Tiwari and Adamowski⁷ as well as Candelieri et al.⁸ also distinguish medium-term predicting with respect to weekly time ranges. It is used in supply network maintenance and developing failure prevention procedures. Future water consumption is always estimated based on historical time series. The methodology of their analyses consists in identifying the current pattern of water demand and predicting future consumption.

Apart from the purpose of water demand prediction that was mentioned above, its key aim is to ensure the appropriate amount of water to users in a long-term perspective, which is a difficult task, mainly in dry climate countries. This problem is especially intensified in large urban centers with a continuously increasing number of residents^9,10,11. The analysis of the aquifer content that changes in time depending on external factors allows to determine their quantitative potential in terms of future water intake, water treatment and pumping it into the water supply system¹². To be able to assess the sufficiency of water resources, it is necessary to implement procedures that are used in long-term forecasting.

Both short-term and long-term predictions require knowledge of the influence of factors that result in irregularities in water consumption from the water supply network. Short-term predictions require mainly the meteorological parameters analysis, whereas long-term forecasts require additionally analyzing demographic and economic factors¹⁰. The current weather conditions, geographical location, and socio-demographic aspects, referred to as anthropogenic¹¹, determine the existence of hourly, daily, weekly or seasonal variability. The influence of anthropogenic factors is globally enhanced on holidays (Sundays, public holidays, and school holidays), in periods preceding religious holidays and locally, for example, during major cultural and leisure events.

One component of smart cities, an idea that is constantly being implemented and improved as part of the city vision of the future¹³ is Intelligent Water Systems (IWSs). Intelligence in the water sector, according to the Water Environment Federation, is evidenced by the use of advanced technologies in decision making and management. In practice, this means, among other things, the need to reduce operational costs, manage and mitigate risks, which should be served by risk assessment solutions, failure prediction, performance prediction and decision support systems¹⁴. Smart water supply systems take into consideration the variability in water consumption in the prediction sector, for systems of residential water saving¹⁵ and in solutions aimed at detecting abnormal states. Modelling their behavior requires advanced methods because of water supply system complexity¹⁶. The implementation of meteorological data enables to reduce the predicting error even by 11%¹⁷. The knowledge of the meteorological factors is necessary to obtain the best quality of water consumption prediction model¹⁸. Not taking into account the variability in water consumption in failure detecting systems results in an increased number of generated false alarms¹⁹. Including meteorological factors in the input data used for water consumption forecasts is therefore crucial, especially during short- and medium-term analyses^13,20,21.

The purpose of the study was to create a methodology for extracting weekly seasonality in water consumption time series using evolutionary algorithms, which were implemented to improve the daily forecast accuracy. To the best of the authors' knowledge, this study is one of the few attempts to application of the evolutionary approach for water consumption time series implemented to improve prediction accuracy. The paper presents the results of short-term predictions of water consumption in one of the District Metered Area (DMA) zones located in Wroclaw (Poland), characterized by multifamily buildings. The applied models are based on evolutionary algorithms and on standard approach: Support Vector Regression (SVR), Multilayer Perceptron (MLP) and Random Forest (RF). Advanced methods in the field of computational intelligence are used in the research. These methods include evolutionary algorithms that discover seasonality, which is hard to determine when using simple methods due to the additional overlapping of, e.g., the development trends of DMA zones and annual seasonality. The implemented procedure allowed the authors to minimize the MAPE forecast error to even less than 2%, taking into account various case study variants. Regular evolutionary strategies may provide a tool that will support water demand prediction and thus improve the ongoing decision-making processes on the level of water supply networks managed and operating. The presented approach towards forecasting water demand levels, based on applying classical evolutionary strategies, shows that getting to know the factors determining water consumption variability is a key aspect. According to the research²², not only the analyses of meteorological parameters should be implemented in WDF models, but they should also take into consideration the periodicity of water consumption resulting from anthropogenic behavior, for instance, the different property characteristics, various household composition in terms of number and age, socioeconomic level and education level of the consumers. Finally, it may also become a component of systems that warn about abnormal states, such as failures, which are a widely analyzed aspect of managing water supply networks²³.

The first section of the article contains a literature review concerning the methods used to predict water consumption. The following sections present the problem statement of the research, the methods as well as the results of the research conducted by the authors, and their interpretation and discussion. The final section of the paper provides some conclusions.

Literature review of water demand forecasting methods

The optimization of water supply system operation is an important and up-to-date problem in civil engineering concerning increasingly scientific research^24,25. The great progress in predicting water consumption that has been noted in recent years was caused by the intensive development and implementation of algorithms embedded in advanced computational intelligence. The subject of analyses and interpretation are not only historical hydraulic parameters, but also meteorological conditions.

The methods are currently applied for water demand forecasting have been presented in the form of a table (Table 1). Different characteristics for every case study and using various kinds of errors for evaluation cause that it is difficult to compare the model implemented by the authors. For this reason, the research compares the proprietary model with standard methods. Moreover, a review of the literature shows that appropriate techniques are rarely used in data preprocessing. Discrete wavelet transform (DWT) is most often used for this purpose²⁶.

Table 1 Review of the water demand forecasting methodology.

Full size table

The knowledge of the nature of water consumption should also constitute the basis for expert systems warning about failures, so-called Leak Detection Systems (LDS), and identifying the location of leaks. Unfortunately, due to the complexity of these solutions, these issues are often neglected. According to Brentan et al.⁶, short-term prediction is the key aspect of the detection of abnormal states. These states may be identified at the moment when the estimated and observed measurement values of hydraulic parameters differ. As far as this category of applications of WDF is concerned, the ARIMA models and ANN were implemented in the past. Okeya et al.⁴³ supplemented their research on leak detection with the WDF method, thus reducing the number of false alarms warning about potential failures. Wu et al.⁴⁴ developed a solution based on the Cluster Analysis of historical flow rate measurements. According to the authors, clusterization-based methods may be effectively applied in the analysis of nonstationary time series. Apart from that, they also minimize the number of positive false alarms and they may be implemented to detect large flow leaks. To ensure that the application of water consumption prediction systems will improve the results of detecting water supply network failures, short- and medium-term analyses should be conducted based on data recorded at time steps not exceeding 15 min⁸.

Research problem definition

As the literature review in the article indicates, none of the methods that have been used so far is universal enough to be used for water consumption forecasts under all water network operating conditions. The non-stationarity of time series, related to the variable character of water consumption and other factors that affect it, makes it necessary to determine the long-term trend, annual and weekly seasonality. The aforementioned conditions result in the necessity to implement analytical tools connected with the variability in water consumption as part of advanced WDF systems^45,46.

Despite the fact that there are many scientific approaches to the problem of water consumption forecasting, none of them is universal enough to be directly implemented in various operating conditions of water supply networks. It is because of the spatial variability of the weather, the specific operational characteristics of water supply systems and their users. The most frequently mentioned problems related to a reliable comparison of the quality of forecasting models include: different characteristics for every case study, forecasting horizon, size and types of samples, demand periodicity, and various, disproportionate forecast errors^5,18. In addition, creating long-term forecasts is subject to bigger errors than short-term forecasts, i.e., daily forecasts are more reliable than weekly ones⁴⁰.

The key issues raised in 6-year research concern the creation of a reliable forecasting model for water consumption based on measurements carried out on the actual water supply network. The obtained time series are difficult to interpret due to the fact that according to the classification, it is a large water supply company with many inhabitants and additionally students in the October-June/July period, which generates dynamic seasonality. According to the data of the Statistical Office in Wroclaw, approximately 120 thousand students resided in Wroclaw, which accounts for nearly 19% of the whole population⁴⁷. During holidays, most of these students leave the city and return in October to continue education. Additionally, in July and August also permanent residents of Wroclaw are out of town on holiday. This phenomenon was highlighted in the works by Candelieri and Archetti²⁸ and Candelieri et al.⁹ and it justifies considering different characteristics for holiday periods in predicting the consumption of tap water. The analyzed area is specific mainly because while other studies indicate an increased water consumption during the summer period⁴⁸, in Wroclaw, a significant decrease is observed Table 1, presented in the previous section, shows the existence of many analytical approaches used in water demand forecasting. Therefore, an evolutionary approach was proposed to identify seasonality in the time series and the created model was compared with the standard methods (SVR, MLP, RF, and CART).

The input information for short-term forecasting, apart from water consumption, usually describes the weather conditions. Opinions on the impact of individual meteorological parameters on the quality of the model are strongly divided. Some scientists do not take them into account at all^5,37,45,49, although the literature review by Sebri¹⁸ indicates that models of better accuracy were obtained for short-term and medium-term forecasts using meteorological data. Typically, the input base is based on weather data including air temperature, occurrence of precipitation, and air humidity. As far as air humidity is concerned, literature reports are contradictory, because some studies show the greatest dependence of water consumption on air humidity⁵⁰, while Ambrosio et al.²⁰ proved that including this parameter in the input data significantly worsens the prediction quality. The quality changes of the model are analyzed in this research, taking into account various input data.

Comprehensive knowledge of the problem results in improving the prediction systems in the analyzed DMA zones, however, it should be noted that both water consumption and meteorological time series are burdened with noise and errors, so working with their use is not a simple process⁵. Models based on actual measurement data are subject to uncertainty resulting from deficiencies caused, among others, by failure of measuring devices, meanwhile, reliable long-term forecasts are based on long-term measurement sequences³⁸. According to the research⁴², for short-term forecasting, a 4-month measurement series is sufficient to obtain reliable water demand forecasts. Another goal of the research is the analysis of critical events in the operation of the water supply network, which largely contributed to the deterioration of the quality of prediction and the development of weekly periodicity patterns in water demand.

Materials and methods

Study area and datasets preparation

The input data for the model, related both to the hydraulic parameters of water flow inside water supply networks and weather elements, were obtained from the monitoring service of the Municipal Water and Sewage Company S.A. in Wroclaw, MPWiK (Lower Silesian Region, Poland). The water supply network of Wroclaw consists of more than 30 DMAs equipped with more than 80 measurement points. The analyzed period covered the daily time series from September 2014 to May 2020.

Water flow supplied to the area was measured with the use of an electromagnetic flow meter that records data at 10 min time intervals. It is installed in the zone measurement well in the multifamily building DMA. Time series recorded with a time step of 10 min are part of a reporting and network control system. A GSM module is used to transmit data to the water supply network operator in real time. Water is supplied to the discussed DMA through a water supply pipe of the diameter of DN500. The analyzed area is inhabited by more than 22 thousand people. To eliminate additional factors that might influence the prediction, data concerning abnormal situations, resulting, among others, from pipe failures and temporary breaks in data recording, were removed from the time series. During such events, zero pressure and flow are recorded.

The automatic meteorological station that provided the measurements is located in the city center. Similarly as in other large urban and industrial agglomerations, land development and land use lead to an Urban Heat Island forming, which results in the modification of temperature and precipitation in the city. The measured basic meteorological parameters included: air humidity and temperature, atmospheric pressure, wind speed, solar irradiation, and precipitation. The data is sent to a datalogger via a wireless GSM/GPRS connection. Although the automatic measurement station was programmed for sampling at 1-min intervals, the analyses used average values and daily sums of the particular meteorological parameters. This procedure was used because weather conditions measured with such a high-frequency measurement interval are not a factor in the ongoing variability of water use. The variation in minutes of weather is never significant in short-term prediction of water demand⁵¹.

To maintain the consistency of the analyses and the subsequent interpretation of the results, both during the creation of the model using standard methods and the evolutionary approach, the same datasets were used. To avoid the overfitting phenomenon, 10 random datasets were selected from the databases containing information on water demand and meteorological parameters. Each time, the training dataset was 365 days of measurement, while the testing set was 30 successive days. In some experiments, in addition to the data containing water consumption and meteorological parameters, characteristic variables were added to the input dataset. Most often, additional variables contain information about the day of the week, split into working days and weekends⁹. The authors' research included one water consumption of the last two weeks (history0—previous 7 days and history1—7 days before history0) and one hot encoding of the day of the week.

Reference methods

In this article, several standard machine learning algorithms were used. To make this article self-contained, a very brief review of the implemented methods was presented. More details about these methods in general can be found in Murphy⁵² and Goodfellow et al.⁵³.

Support vector regression

The idea of Support Vector Regression is similar to Support Vector Machines, but SVR concerns regression problems, while SVM—classification problems. SVR is a type of regressor, but more tolerant to outliers than regular linear regression as it does not count as error values which are close enough to the correct value. It can model nonlinear dependencies by mapping the original data space to a new data space, usually of a higher dimension, by some nonlinear kernel functions and using the kernel trick, which simplifies the transformation by allowing to express the inner products in the new data space by some precomputed values related to the kernels. In our experiments, various kernels were used, such as the linear kernel, the polynomial kernel, the sigmoid kernel or the RBF kernel, each of them with a different parameter setting leading to the best performance on a selected dataset, and the best results were obtained with the simplest variant (the linear kernel).

A multilayer perceptron

Nowadays, artificial neural networks are one of the most promising and most popular methods in machine learning. They are loosely inspired by biological neural networks, but can be treated as a way of describing a computation of a function values using well-defined, differentiable operations: matrix multiplication and addition, function composition, and some nonlinear functions (such as tanh, or Rectified Linear Units, ReLU). The authors used the popular architecture of a multilayer perceptron with two layers of hidden neurons (as more hidden layers did not lead to significant improvements in the computational experiments, only to overfitting). Several hidden layer sizes were tested in this research.

Linear regression

In this approach, the authors assume that the predicted value is a linear combination of all features (all categorical features, such as days of the week, should be represented by several binary variables, using one-hot encoding). The optimal values of the coefficients can be found using linear algebra methods or gradient-based methods (such as Stochastic Gradient Descent, SGD). LR is as one of the simplest machine learning methods.

Classification and regression trees (CART)

The authors construct a tree which describes in a concise manner what conditions the feature values should meet to give certain output values. CART method allows to describe very precisely the training set, but if it is necessary to achieve good generalization properties, some pruning techniques should be used.

Random Forests

Random Forests are an example of ensemble learning. In these methods, the authors try to combine the results of many rather weak classifiers/regressors. In random forests, CART trees (with limited depth) were used. These trees use different subsets of learning examples and different subsets of features as well. The authors used two depth limits: 5 and 10.

For all these methods, the scikit-learn library for Python⁵⁴ programming language was used.

Evaluating forecast accuracy

The main aim is to minimize the objective function, which is the Mean Absolute Percentage Error (MAPE) (using to evaluate models’ accuracy, among others by Bakker et al.¹⁷; Candelieri and Archetti²⁸; Candelieri et al.⁹; Xu et al.³⁷) of the forecasts obtained with use of machine learning. This kind of error is scale-independent, so it is especially recommended for use in the predictive model evaluation¹⁸. MAPE may be described by the following equation⁵⁵:

$$MAPE=\frac{1}{n}\sum_{t=1}^{n}\frac{\left|{\widehat{y}}_{t}-{y}_{t}\right|}{{y}_{t}}\cdot 100 \%$$

(1)

where: $n$—the size of the sample, ${\widehat{y}}_{t}$—the value predicted by the model for time point $t$, ${y}_{t}$—the value observed at time point $t$.

Evolutionary approach

Most of the prediction models require stationary time series to provide accurate predictions. However, time series of water consumption usually reveal a type of annual and weekly periodicity, which makes the time series not stationary and thus difficult to predict by regular prediction models. The annual periodicity cannot be directly extracted from the data, because it is usually irregular and related to the characteristic of a particular season in a particular year (early or late spring, dry or wet autumn, etc.), but it is strongly related to the weather conditions, so its influence on the prediction accuracy in more complex models can be reduced by adding the data describing current weather conditions. The weekly periodicity is more difficult to reduce, because it is also noised by the irregular annual periodicity.

In our approach, the input data for the model were water consumption and meteorological parameters (Fig. 1). First, the water consumption data (Q), obtained from measuring equipment and resampled to one-day frequency, were preprocessed by extracting anomalies and the global trend (defined by the 365-days moving average). Second, the water consumption data (Q) were expressed as a fraction (Q′) of the 7-days moving average (i.e., the average flow rate from the 7 days preceding each of the analyzed days), so that:

$$Q{^{\prime}}=\frac{Q}{{Q}_{7}}$$

(2)

where: $Q{^{\prime}}$—the current value of water consumption in relation to the previous 7 days average, $Q$—water consumption, ${\mathrm{Q}}_{7}$—the daily average of water consumption in the last 7 days.

Next, the relative water consumption data (Q′) were further preprocessed by extracting a weekly seasonality (evaluated either by the regular approach as a simple average of Mondays, Tuesdays, etc., or by the proposed evolutionary approach described further). Afterwards, the relative water consumption (Q′) was predicted using a linear regression model with the input data consisting of the values of Q′ from the previous 7 days and the weather data from the last day. Predictions were transformed to the original data scale by adding the weekly seasonality and the global trend and compared to the original data to evaluate the MAPE.

In the research, the prediction models were enhanced with evolutionary algorithms that discover the weekly periodicity in the optimization process. From the technical point of view, the weekly periodicity is a real number vector of length 7, and it is defined in a type of learning process as a solution to the optimization problem with the objective function being the accuracy of the prediction over a predefined training dataset and the search space consisting of real number vectors of length 7. To solve the optimization problem, an evolutionary algorithm based on an Evolution Strategy⁵⁶ is proposed. It evolves a population of candidate solutions in the following process: first, the population includes a given number of random candidate solutions (random vectors from the search space drawn from the uniform probability distribution over the search space). Afterwards, the evolution starts and, in each iteration, a given number of parent candidate solutions is selected, then each pair of parent candidate solutions produces a pair of child candidate solutions, and finally, the selected child candidate solution replaces the previous candidate solution in the current population. Parent candidate solutions are selected with the probability proportional to their value of the objective function. Children are produced by local intermediary recombination and mutation operators, as in regular Evolution Strategies. The current population is updated with the best candidate solutions from the union of the current population and the child population. Finally, the evolution terminates after a given number of iterations.

In the experiments, different configuration settings of the evolutionary algorithm were studied to calibrate the approach and the best results were obtained with the parent population size of 200, the offspring population size of 400, the number of iterations of 50 and other parameters set according to the general heuristics for Evolutionary Strategies⁵⁶. Figure 2 presents the general schema of the algorithm.

Consent for publication

All authors have given their permission to publish.

Results and discussion

Reference methods

In the first part of the research that was based on the standard approaches (SVR, MLP, LR, CART, and RF), it was assessed which of them allow to obtain the lowest possible value of the MAPE error, taking into account various variants of the input data. The learning process was carried out for all 10 datasets with information on historical water consumption (history0 and history1) and taking into account the optimal set of features. The implementation of all variables in the learning process could result in overfitting the model, therefore the aim of this part of the research is also to select the optimal set of input data. Since 12 features and extra features are blocked in 3 groups, there were 16,384 combinations and for every such combination the learning and evaluation procedure was performed.

Table 2 presents the results of the preliminary MAPE errors for the models created with the use of linear regression and for example, one dataset, taking into account various input data combinations.

Table 2 Mean absolute percentage error for linear regression and selected input data.

Full size table

Forecast errors ranging from 2.38 to 3.66% were obtained with the use of linear regression models. Out of all possible combinations of input data sets, the model with the best learning quality (MAPE equal to 2.38%) was obtained for the historical data, taking into account the water consumption level from the preceding 7 and 14 days, as well as a set of meteorological data: air humidity, wind speed and soil temperature. Due to the fact that some of the meteorological data are strongly correlated, e.g., soil temperature and air temperature, the use of all data does not improve the learning quality (MAPE error is 2.81%) and can lead to overfitting, even with such a simple approach as linear regression. The study results clearly show that the wind speed and historical values of water consumption are important in each case. Nevertheless, it should be emphasized that the most important factor is including the historical 7 and 14-day time series in the input data. The MAPE error for modelling with and without weather data differs by 0.08% (between the best two features datasets and the best without weekdays datasets) and 0.29% (between the best one feature and the best without weekday datasets). The meteorological parameters are strongly correlated with each other, therefore their selection for forecasting is biased.

Days of the week data were entered as the only input, apart from water consumption for modelling by, e.g., Bakker et al.⁴⁵, where MAPE errors ranging from 1.44 to 5.12% were obtained for 24-h forecasts. Further research results, however, indicate that weather data significantly improve the model accuracy¹⁷. Antunes et al.⁵⁷ based on their research indicate that the most important meteorological parameters when forecasting water consumption are: temperature, rain level, and rain occurrence, whereas Piasecki et al.⁵⁰ claim, it is air humidity. Comparing the research results of other authors is not entirely reliable due to the fact that each of them has a different set of data, which depends on the local capabilities related to the possession of measuring devices as well as the forecasting scope. Pesantez et al.⁵¹ showed that for hourly forecasts, it is not necessary to have weather condition parameters due to their low variability within this time frame, but the research was carried out in relation to the users of the water supply network and not the entire DMA zone. The results of this research indicate that for daily or monthly forecasts, the information on the weather improves the accuracy of the model. The input data should include not only meteorological parameters, but most of all data informing about the historical variability of water consumption (7 and/or 14 days). They contain seasonality and variability caused mainly by human activity and its repetitive behavior in this area.

Due to the fact that the learning process was carried out on 10 fixed data sets, and one of the objectives was to investigate to what extent the selection of optimal features is random, a random forest regressor was used and the feature selection was performed multiple times. It was also checked if the feature selection is generally a good approach by comparing the performance of the regressors with all features and with a selected set. At the same time, it should be noted that the choice was made for one classifier and then tested on another. In the next stage of analyses, a water consumption model was created for the optimal set of features (history0, history1, days of the week, precipitation, wind speed, ground temperature) and the set of all features using methods more advanced than linear regression, i.e. with SVR, MPL and RF. This stage was aimed at indicating which of the methods will minimize MAPE and thus be implemented in the authors' evolutionary approach, with the optimal set of input data already selected. RF-k random forests with depth limited to K, Classification and Regression Trees (CART), MLPk-n Multi-Layer Perception regressor with k and n neurons in two hidden layers, and SVR-linear Support Vector regressor with linear kernel were used. Table 3 presents the obtained values of the MAPE forecast errors for the previously selected optimal variables and all features. Results are averaged over all 10 datasets, detailed results for each are presented in Appendix 1.

Table 3 Forecast mean absolute percentage error comparison for selected methods.

Full size table

The obtained study results indicate that the lowest MAPE error in the forecast of water consumption for all 10 datasets was obtained through Linear Regression and the largest for Multilayer Perceptron. This regularity applies both to the situation where all features were included in the input data and where those selected in the previous step were included. Random Forest method is the second that gives the lowest MAPE error after linear regression. The use of the most optimal set of features in the input model (history0, history1, days of the week, precipitation, wind speed, ground temperature) resulted in obtaining a smaller forecast error for practically most of the methods.

The obtained research results indicate that models based on linear regression are suitable for creating water demand forecasts without the need to implement more advanced solutions. After selecting the optimal set of features, even before implementing the evolutionary approach, which is the main goal of the research, the MAPE error is obtained at a satisfactory level of 2.2%.

Evolutionary approach

This part of the article presents the results of research into the creation of a prognostic model containing implemented evolutionary algorithms, in which 10 selected datasets were used as testing data (similar to those in “Reference methods” section). The training data was a period of 1 year before each of the test datasets. To improve the quality of prediction, the evolutionary approach was implemented in conjunction with Linear Regression as the method through which the lowest MAPE error values were achieved.

Table 4 presents the prediction error for the prediction algorithm used in this experiment with the weekly periodicity vector determined by the evolutionary algorithm. MAPE values obtained during the implementation of the evolutionary approach for each of the ten datasets are presented. It includes both the quality of the forecasts for the training and testing sets.

Table 4 Prediction error for a method containing an evolutionary approach.

Full size table

The MAPE error for the training sets was 2.00%, while for the testing sets it was 2.12%. The results of the research show that the model with the implemented evolutionary approach allows to obtain water consumption forecasts with an error of only 1.31%. According to predictive model quality standards, a MAPE error below 10% should be regarded as a determinant of highly accurate forecasting⁵⁸. Although it is not possible to directly compare the models created by other authors due to the different conditions and the scope of the conducted research, there are a number of results in which the MAPE values of forecasts range from 2.0 to 3.0%^17,37,50 or they were containing larger error^49,57,59. To make the obtained results more reliable, water consumption forecasts were intentionally made using standard methods (LR, SVR, MLP, RF, CART) on analogous data sets, which was described in the previous chapter. The implementation of evolutionary procedures allows to create short-term models of water consumption forecasts that are much more reliable, although the final value of the MAPE error is influenced by the measurement period and the range in which weekly periodicity was extracted from the time series.

The results and diagrams for all ten datasets are included in the attachments, which constitute an appendix to the article (Appendix 2). The maximum likelihood model with a testing error of 1.31% was obtained for dataset no. 3 (Figs. 3. and 4). The testing process (30 successive days) ended with the greatest error in the case of September 24, 2016, when water consumption in the selected zone was almost 2.5 times higher than normal. It was caused by additional supply to the neighboring zone via the analyzed DMA. Similarly, on September 29, 2016, and October 3, 2016, the error value was significant due to a failure of the measuring device. All above-mentioned situations do not diminish the quality of the forecasting model due to the fact that they are random events.

The research results indicate that the worst accuracy of testing was obtained for dataset no. 7 from 2018-07-21 to 2018-08-20 (Figs. 5 and 6). The increase in the forecast error was caused by the record-breaking hot period, which resulted in increased consumption of tap water. This is another proof that weather conditions affect the forecasts of water consumption and its anomalies significantly affect the deterioration of the prediction.

As defined in “Evolutionary approach” section, the weekly periodicity vector is discovered by an evolutionary algorithm as a solution to an optimization problem with the objective function that evaluates the accuracy of the candidate weekly periodicity vector on the training dataset. Figure 7 shows the distribution of weekly periodicity vector in water consumption. The evolutionary approach makes it possible to formulate some rules for water consumption by network users. It can be noticed that the greatest variability in water consumption is visible on Saturdays, while the lowest on Wednesdays. The results from end of the working week (Thursday–Friday) indicate that users of the water supply network are probably postponing household chores until Saturday. The water consumption habits of the inhabitants of DMA zones constitute a fairly large part of the research conducted by other scientists²² and the evolutionary approach may indirectly contribute to them and facilitate the creation of water demand patterns^28,60.

Figure 8 presents the objective function in the successive iterations of the evolutionary algorithm. It is easy to see that the evolutionary algorithm is capable of solving the optimization problem and discover a quasi-solution after about 5 iterations. A good learning algorithm should minimize the number of calculations required to achieve a good accuracy of the model⁶¹. For the learning process that models water demand, which is shaped by so many random factors, this is not a high number of iterations, which demonstrates that the proposed predicting methodology is effective.

Figure 9 shows the evolution (changes in successive iterations of learning) of the weekly seasonality vector. In the chart, they are responsible for the consecutive days of the week. In the first iteration, the vector is random (not learned), while in the last iteration, it is the final vector. In intermediate iterations, it is noticeable how the evolution algorithm adjusts the vector values. For the first 10 or so iterations, the changes are quite large, then small, which means that the final vector was found quite quickly, later it was only slightly adjusted. There is an increased consumption of water on Saturdays, so the weight of the average water demand on a given weekday in relation to the weekly average is higher for Saturday.

Conclusions and future work

Modern management of water supply infrastructure widely uses the development of information technologies that enable to enhance the processes connected with its operation. Time series of basic hydraulic parameters support increasingly advanced analytical and diagnostic tools. Water consumption prediction in various time horizons enables, among others, planning and performing maintenance and renovation works effectively, optimizing the pump station operations and detecting failures.

The research involved the implementation of a classical evolutionary approach for improving short-term prediction of water demand in one of the Wroclaw DMA zones with multifamily buildings. Additionally, an evolutionary strategy, which had not been used previously in this field of studies, was employed to determine the influence of weekly seasonality on water consumption distribution. The studies demonstrated the advantage of evolutionary strategies in combination with LR over standard methods of forecasting (SVR, MLP, RF, CART). The solution proposed by the authors enables to predict the volume of water consumption, expressed as the Mean Absolute Percentage Error (MAPE) more efficiently. The obtained final average values of MAPE errors were 2.12% (for the best 1.31% and worst 3.31% variant of forecasting, respectively). The research also demonstrated that evolutionary strategies are useful for data preprocessing and improving the knowledge of the anthropogenic behavior of water supply network users. Moreover, the results of the research indicate that the inclusion of additional variables in the input data in the form of meteorological parameters and water consumption for the preceding two weeks improves the accuracy of prediction.

In further studies, the authors intend to tackle the issues of predicting water demand in other DMA zones managed by the Municipal Water and Sewage Company, but of different consumption types, e.g., industrial consumption. An important element of the planned studies will be the spatial correlation of observations obtained from monitoring individual water supply areas.

Data availability

For data sources, see the Acknowledgments section; on analyses in this manuscript, please contact: justyna.stanczyk@upwr.edu.pl.

References

Haddeland, I. et al. Global water resources affected by human interventions and climate change. PNAS 111, 3251–3256. https://doi.org/10.1073/pnas.1222475110 (2014).
Article ADS CAS PubMed Google Scholar
Hussain, Z. et al. A comparative appraisal of classical and holistic water scarcity indicators. Water Resour. Manag. 36, 931–950. https://doi.org/10.1007/s11269-022-03061-z (2022).
Article Google Scholar
Rinaudo, J.-D. Long-term water demand forecasting. Understanding and managing urban water in transition. 239–268 (2015).
Shirkoohi, M. G., Doghri, M. & Duchesne, S. Short-term water demand predictions coupling an artificial neural network model and a genetic algorithm. Water Supply 21, 2374–2386. https://doi.org/10.2166/ws.2021.049 (2021).
Article Google Scholar
Candelieri, A. et al. Tuning hyperparameters of a SVM-based water demand forecasting system through parallel global optimization. Comput. Oper. Res. 106, 202–209. https://doi.org/10.1016/j.cor.2018.01.013 (2019).
Article MathSciNet Google Scholar
Brentan, B. M., Luvizotto, E. Jr., Herrera, M., Izquierdo, J. & Pérez-García, R. Hybrid regression model for near real-time urban water demand forecasting. J. Comput. Appl. Math. 309, 532–541. https://doi.org/10.1016/j.cam.2016.02.009 (2017).
Article MathSciNet MATH Google Scholar
Tiwari, M. K. & Adamowski, J. F. Medium-term urban water demand forecasting with limited data using an ensemble wavelet–bootstrap machine-learning approach. J. Water Resour. Plan. Manag. 141, 04014053. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000454 (2015).
Article Google Scholar
Candelieri, A., Soldi, D. & Archetti, F. Layered machine learning for short-term water demand forecasting. Eng. Manag. J. 14, 2061–2072. https://doi.org/10.30638/eemj.2015.221 (2015).
Article Google Scholar
Candelieri, A., Soldi, D. & Archetti, F. Short-term forecasting of hourly water consumption by using automatic metering readers data. Procedia Eng. 119, 844–853. https://doi.org/10.1016/j.proeng.2015.08.948 (2015).
Article Google Scholar
Ghiassi, M., Fa’al, F. & Abrishamchi, A. Large metropolitan water demand forecasting using DAN2, FTDNN, and KNN models: A case study of the city of Tehran, Iran. Urban Water J. 14, 655–659. https://doi.org/10.1080/1573062X.2016.1223858 (2017).
Article Google Scholar
Hemati, A., Rippy, M. A., Grant, S. B., Davis, K. & Feldman, D. Deconstructing demand: The anthropogenic and climatic drivers of urban water consumption. Environ. Sci. Technol. 50, 12557–12566. https://doi.org/10.1021/acs.est.6b02938 (2016).
Article ADS CAS PubMed Google Scholar
Guyennon, N., Romano, E. & Portoghese, I. Long-term climate sensitivity of an integrated water supply system: The role of irrigation. Sci. Total Environ. 565, 68–81. https://doi.org/10.1016/j.scitotenv.2016.04.157 (2016).
Article ADS CAS PubMed Google Scholar
Berglund, E. Z. et al. State-of-the-art review: smart infrastructure: A vision for the role of the civil engineering profession in smart cities. J. Infrastruct. Syst. 26(2), 03120001. https://doi.org/10.1061/(ASCE)IS.1943-555X.0000549 (2020).
Article Google Scholar
Dawood, T., Elwakil, E., Novoa, H. M. & Delgado, J. F. G. Ensemble intelligent systems for predicting water network condition index. Sustain. Cities Soc. 73, 103104. https://doi.org/10.1016/j.scs.2021.103104 (2021).
Article Google Scholar
Novak, J. et al. Integrating behavioural change and gamified incentive modelling for stimulating water saving. Environ. Model. Softw. 102, 120–137. https://doi.org/10.1016/j.envsoft.2017.11.038 (2018).
Article Google Scholar
Tang, K., Parsons, D. J. & Jude, S. Comparison of automatic and guided learning for Bayesian networks to analyse pipe failures in the water distribution system. Reliab. Eng. Syst. Saf. 186, 24–36. https://doi.org/10.1016/j.ress.2019.02.001 (2019).
Article Google Scholar
Bakker, M., Van Duist, H., Van Schagen, K., Vreeburg, J. & Rietveld, L. Improving the performance of water demand forecasting models by using weather input. Procedia Eng. 70, 93–102. https://doi.org/10.1016/j.proeng.2014.02.012 (2014).
Article Google Scholar
Sebri, M. Forecasting urban water demand: A meta-regression analysis. J. Environ. Manag. 183, 777–785. https://doi.org/10.1016/j.jenvman.2016.09.032 (2016).
Article Google Scholar
Eliades, D. G. & Polycarpou, M. M. Leakage fault detection in district metered areas of water distribution systems. J. Hydroinformatics 14, 992–1005. https://doi.org/10.2166/hydro.2012.109 (2012).
Article Google Scholar
Ambrosio, J. K. et al. Committee machines for hourly water demand forecasting in water supply systems. Math. Probl. Eng. https://doi.org/10.1155/2019/9765468 (2019).
Article Google Scholar
Pacchin, E., Gagliardi, F., Alvisi, S. & Franchini, M. A comparison of short-term water demand forecasting models. Water Resour. Manag. 33, 1481–1497. https://doi.org/10.1007/s11269-019-02213-y (2019).
Article Google Scholar
Vieira, P., Jorge, C. & Covas, D. Assessment of household water use efficiency using performance indices. Resour. Conserv. Recycl. 116, 94–106. https://doi.org/10.1016/j.resconrec.2016.09.007 (2017).
Article Google Scholar
Zangenehmadar, Z. & Moselhi, O. Prioritizing deterioration factors of water pipelines using Delphi method. Measurement 90, 491–499. https://doi.org/10.1016/j.measurement.2016.05.001 (2016).
Article ADS Google Scholar
Arsene, C. T. & Gabrys, B. Mixed simulation-state estimation of water distribution systems based on a least squares loop flows state estimator. Appl. Math. Model. 38, 599–619. https://doi.org/10.1016/j.apm.2013.06.012 (2014).
Article MathSciNet MATH Google Scholar
Tavakoli, A. & Rahimpour, M. Gröbner bases for solving ΔQ-equations in water distribution networks. Appl. Math. Model. 38, 562–575. https://doi.org/10.1016/j.apm.2013.06.022 (2014).
Article MathSciNet MATH Google Scholar
Zubaidi, S. L. et al. A novel methodology for prediction urban water demand by wavelet denoising and adaptive neuro-fuzzy inference system approach. Water 12, 1628. https://doi.org/10.3390/w12061628 (2020).
Article Google Scholar
Alvisi, S. & Franchini, M. Assessment of the predictive uncertainty within the framework of water demand forecasting by using the model conditional processor. Procedia Eng. 89, 893–900. https://doi.org/10.1016/j.proeng.2014.11.522 (2014).
Article Google Scholar
Candelieri, A. & Archetti, F. Identifying typical urban water demand patterns for a reliable short-term forecasting–the icewater project approach. Procedia Eng. 89, 1004–1012. https://doi.org/10.1016/j.proeng.2014.11.218 (2014).
Article Google Scholar
Chen, J. & Boccelli, D. Demand forecasting for water distribution systems. Procedia Eng. 70, 339–342. https://doi.org/10.1016/j.proeng.2014.02.038 (2014).
Article Google Scholar
Kofinas, D., Mellios, N., Papageorgiou, E. & Laspidou, C. Urban water demand forecasting for the island of Skiathos. Procedia Eng. 89, 1023–1030. https://doi.org/10.1016/j.proeng.2014.11.220 (2014).
Article Google Scholar
Romano, M. & Kapelan, Z. Adaptive water demand forecasting for near real-time management of smart water distribution systems. Environ. Model. Soft. 60, 265–276. https://doi.org/10.1016/j.envsoft.2014.06.016 (2014).
Article Google Scholar
Tiwari, M., Adamowski, J. & Adamowski, K. Water demand forecasting using extreme learning machines. J. Water Land Dev. https://doi.org/10.1515/jwld-2016-0004 (2016).
Article Google Scholar
Ernesto, A., Amadou, Ba., Bradley, E. & Sean, McKenna. Tailoring seasonal time series models to forecast short-term water demand. J. Water Resour. Plan. Manag. 142, 04015067. https://doi.org/10.1061/(asce)wr.1943-5452.0000591 (2016).
Article Google Scholar
Duerr, I. et al. Forecasting urban household water demand with statistical and machine learning methods using large space-time data: A comparative study. Environ. Model. Softw. 102, 29–38. https://doi.org/10.1016/j.envsoft.2018.01.002 (2018).
Article Google Scholar
Kozłowski, E., Kowalska, B., Kowalski, D. & Mazurkiewicz, D. Water demand forecasting by trend and harmonic analysis. Arch. Civ. Mech. Eng. 18, 140–148. https://doi.org/10.1016/j.acme.2017.05.006 (2018).
Article Google Scholar
Xenochristou, M., Kapelan, Z., Hutton, C. & Hofman, J. Smart water demand forecasting: Learning from the data. EPiC Ser. Eng. 3, 2351–2358. https://doi.org/10.29007/wkp4 (2018).
Article Google Scholar
Xu, Y., Zhang, J., Long, Z., Tang, H. & Zhang, X. Hourly urban water demand forecasting using the continuous deep belief echo state network. Water 11, 351. https://doi.org/10.3390/w11020351 (2019).
Article Google Scholar
Guo, W., Liu, T., Dai, F. & Xu, P. An improved whale optimization algorithm for forecasting water resources demand. Appl. Soft Comput. 86, 105925. https://doi.org/10.1016/j.asoc.2019.105925 (2020).
Article Google Scholar
Karamaziotis, P. I., Raptis, A., Nikolopoulos, K., Litsiou, K. & Assimakopoulos, V. An empirical investigation of water consumption forecasting methods. Int. J. Forecast. 36, 588–606. https://doi.org/10.1016/j.ijforecast.2019.07.009 (2020).
Article Google Scholar
Smolak, K. et al. Applying human mobility and water consumption data for short-term water demand forecasting using classical and machine learning models. Urban Water J. 17, 32–42. https://doi.org/10.1080/1573062X.2020.1734947 (2020).
Article Google Scholar
Bata, M., Carriveau, R. & Ting, D.S.-K. Short-term water demand forecasting using hybrid supervised and unsupervised machine learning model. Smart Water 5, 2. https://doi.org/10.1186/s40713-020-00020-y (2020).
Article Google Scholar
Bata, M., Carriveau, R. & Ting, D.S.-K. Short-term water demand forecasting using nonlinear autoregressive artificial neural networks. J. Water Resour. Plan. Manag. 146, 04020008. https://doi.org/10.1061/(asce)wr.1943-5452.0001165 (2020).
Article Google Scholar
Okeya, I., Kapelan, Z., Hutton, C. & Naga, D. Online burst detection in a water distribution system using the Kalman filter and hydraulic modelling. Procedia Eng. 89, 418–427. https://doi.org/10.1016/j.proeng.2014.11.207 (2014).
Article Google Scholar
Wu, Y., Liu, S., Wu, X., Liu, Y. & Guan, Y. Burst detection in district metering areas using a data driven clustering algorithm. Water Res. 100, 28–37. https://doi.org/10.1016/j.watres.2016.05.016 (2016).
Article CAS PubMed Google Scholar
Bakker, M., Vreeburg, J., Van Schagen, K. & Rietveld, L. A fully adaptive forecasting model for short-term drinking water demand. Environ. Model. Softw. 48, 141–151. https://doi.org/10.1016/j.envsoft.2013.06.012 (2013).
Article Google Scholar
Zhou, S. L., McMahon, T. A., Walton, A. & Lewis, J. Forecasting operational demand for an urban water supply zone. J. Hydrol. 259, 189–202. https://doi.org/10.1016/S00221694(01)00582-0 (2002).
Article ADS Google Scholar
Statistical Office in Wrocław, 2017. Wrocław in Figures. http://wroclaw.stat.gov.pl/publikacje-i-foldery/foldery/wroclaw-w-liczbach-2017-folder,1,4.html (Accessed 29 Sept 2017).
Maruyama, Y. & Yamamoto, H. A study of statistical forecasting method concerning water demand. Procedia Manuf. 39, 1801–1808. https://doi.org/10.1016/j.promfg.2020.01.259 (2019).
Article Google Scholar
Velasco, L., Granados, A., Ortega, J. & Pagtalunan, K. Medium-term water consumption forecasting using artificial neural networks. Presented at the 17th Conference of the Science Council of Asia, National Research Council of the Philippines (2017)
Piasecki, A., Jurasz, J. & Kaźmierczak, B. Forecasting daily water consumption: A case study in Torun, Poland. Period. Polytech.-Civ. 62, 818–824. https://doi.org/10.3311/PPci.11930 (2018).
Article Google Scholar
Pesantez, J. E., Berglund, E. Z. & Kaza, N. Smart meters data for modeling and forecasting water demand at the user-level. Environ. Model. Softw. 125, 104633. https://doi.org/10.1016/j.envsoft.2020.104633 (2020).
Article Google Scholar
Murphy, K. P. Machine learning: a probabilistic perspective (MIT Press, 2012).
MATH Google Scholar
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (Adaptive Computation and Machine Learning Series). 321–359 (2017)
Scikit-learn, machine learning in Python. https://scikit-learn.org.
Moreno, J. J. M., Pol, A. P., Abad, A. S. & Blasco, B. C. Using the R-MAPE index as a resistant measure of forecast accuracy. Psicothema 25, 500–506. https://doi.org/10.7334/psicothema2013.23 (2013).
Article Google Scholar
Kramer, O. Machine Learning for Evolution Strategies (Springer, 2016). https://doi.org/10.1007/978-3-319-33383-0.
Book MATH Google Scholar
Antunes, A., Andrade-Campos, A., Sardinha-Lourenço, A. & Oliveira, M. Short-term water demand forecasting using machine learning techniques. J. Hydroinformatics. 20, 1343–1366. https://doi.org/10.2166/hydro.2018.163 (2018).
Article Google Scholar
Lewis, C. D. Industrial and Business Forecasting Methods: A Practical Guide to Exponential Smoothing and Curve Fitting (Butterworth-Heinemann, 1982).
Google Scholar
Benítez, R. et al. A short-term data based water consumption prediction approach. Energies 12, 2359. https://doi.org/10.3390/en12122359 (2019).
Article Google Scholar
Alvisi, S., Franchini, M. & Marinelli, A. A short-term, pattern-based model for water-demand forecasting. J. Hydroinformatics. 9, 39–50. https://doi.org/10.2166/hydro.2006.016 (2007).
Article Google Scholar
Joachims, T. Making large-scale SVM learning practical. Technical report (1998) https://www.econstor.eu/handle/10419/77178.

Download references

Acknowledgements

Authors would like to thank the employees of the New Technologies Centre, Municipal Water and Sewage Company S.A. in Wroclaw, for cooperation and sharing the data used in this research. The source of the meteorological data is the Institute of Meteorology and Water Management—National Research Institute (PIB). Data from the Institute of Meteorology and Water Management—National Research Institute have been processed.

Funding

This work was supported by the Wrocław University of Environmental and Life Sciences (Poland) as the Ph.D. research program "Innowacyjny Naukowiec, no. N060/0014/21”. Calculations have been carried out using resources provided by Wroclaw Centre for Networking and Supercomputing (http://wcss.pl), Grant No. 405. Computational intelligence research in this work was supported by the Polish National Science Centre (NCN) under grant OPUS-18 no. 2019/35/B/ST6/04379.

Author information

Authors and Affiliations

Institute of Environmental Engineering, Wroclaw University of Environmental and Life Sciences, 24 Grunwaldzki Square, 50-363, Wrocław, Poland
Justyna Stańczyk & Joanna Kajewska-Szkudlarek
Institute of Computer Science, University of Wroclaw, 15 Joliot-Curie Street, 50-383, Wrocław, Poland
Piotr Lipiński & Paweł Rychlikowski

Authors

Justyna Stańczyk
View author publications
You can also search for this author in PubMed Google Scholar
Joanna Kajewska-Szkudlarek
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Lipiński
View author publications
You can also search for this author in PubMed Google Scholar
Paweł Rychlikowski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.S. and J.K-S. prepared the state of the art, defined the problem, and designed the research framework; P.L. designed the prediction scheme and the evolutionary improvement, P.R. designed the evaluation and comparison scheme, P.L. and P.R. performed the computational experiments, J.S. studied and interpreted the results; and J.S., J.K-S., P.L. and P.R. wrote the paper.

Corresponding author

Correspondence to Joanna Kajewska-Szkudlarek.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1: The results of the implementation of the references methods (MAPE)

Methods	Average	Set 1	Set 2	Set 3	Set 4	Set 5	Set 6	Set 7	Set 8	Set 9	Set 10
Linear regression	0.02197	0.01628	0.02329	0.01408	0.02201	0.01896	0.02024	0.03414	0.01675	0.02895	0.02500
SVR-linear	0.02449	0.01430	0.02184	0.02063	0.02806	0.02153	0.02240	0.03502	0.01996	0.02991	0.03126
RF-5	0.02408	0.02626	0.02204	0.01868	0.02340	0.02030	0.02493	0.03134	0.01701	0.03116	0.02570
RF-10	0.02319	0.02359	0.02211	0.01780	0.02551	0.01870	0.02254	0.03035	0.01663	0.02939	0.02533
CART-Tree	0.03356	0.03937	0.02674	0.03355	0.03578	0.02990	0.02995	0.03084	0.02347	0.05135	0.03466
MLP20-15	0.03508	0.04454	0.03069	0.02690	0.03521	0.03822	0.02439	0.03599	0.04117	0.03774	0.03592
MLP30-10	0.03162	0.02257	0.02337	0.03470	0.02322	0.02532	0.03753	0.03745	0.02285	0.04982	0.03933
MLP15-10	0.02969	0.02444	0.02445	0.02669	0.02540	0.02656	0.03170	0.04073	0.03646	0.02603	0.03443

Appendix 2: The results of the implementation of the evolutionary approach

Test datasets

Dataset no. 1

Dataset no. 2

Dataset no. 3

Dataset no. 4

Dataset no. 5

Dataset no. 6

Dataset no. 7

Dataset no. 8

Dataset no. 9

Dataset no. 10.

Training datasets

Dataset no. 1

Dataset no. 2

Dataset no. 3

Dataset no. 4

Dataset no. 5

Dataset no. 6

Dataset no. 7

Dataset no. 8

Dataset no. 9

Dataset no. 10

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stańczyk, J., Kajewska-Szkudlarek, J., Lipiński, P. et al. Improving short-term water demand forecasting using evolutionary algorithms. Sci Rep 12, 13522 (2022). https://doi.org/10.1038/s41598-022-17177-0

Download citation

Received: 15 September 2021
Accepted: 21 July 2022
Published: 08 August 2022
DOI: https://doi.org/10.1038/s41598-022-17177-0

This article is cited by

Rewards, risks and responsible deployment of artificial intelligence in water systems
- Catherine E. Richards
- Asaf Tzachor
- Richard Fenner
Nature Water (2023)
Fermatean hesitant fuzzy rough aggregation operators and their applications in multiple criteria group decision-making
- Attaullah
- Noor Rehman
- Gustavo Santos-García
Scientific Reports (2023)
The impact of the number of high temporal resolution water meters on the determinism of water consumption in a district metered area
- Justyna Stańczyk
- Krzysztof Pałczyński
- Paweł Licznar
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Machine-learning algorithms for forecast-informed reservoir operation (FIRO) to reduce flood damages

The impact of the number of high temporal resolution water meters on the determinism of water consumption in a district metered area

Short-term streamflow modeling using data-intelligence evolutionary machine learning models

Introduction

Literature review of water demand forecasting methods

Research problem definition

Materials and methods

Study area and datasets preparation

Reference methods

Support vector regression

A multilayer perceptron

Linear regression

Classification and regression trees (CART)

Random Forests

Evaluating forecast accuracy

Evolutionary approach

Consent for publication

Results and discussion

Reference methods

Evolutionary approach

Conclusions and future work

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Appendices

Appendices

Appendix 1: The results of the implementation of the references methods (MAPE)

Appendix 2: The results of the implementation of the evolutionary approach

Test datasets

Training datasets

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Rewards, risks and responsible deployment of artificial intelligence in water systems

Fermatean hesitant fuzzy rough aggregation operators and their applications in multiple criteria group decision-making

The impact of the number of high temporal resolution water meters on the determinism of water consumption in a district metered area

Comments

Search

Quick links