Application of analysis of variance to determine important features of signals for diagnostic classifiers of displacement pumps

Konieczny, Jarosław; Łatas, Waldemar; Stojek, Jerzy

doi:10.1038/s41598-024-56498-0

Download PDF

Article
Open access
Published: 13 March 2024

Application of analysis of variance to determine important features of signals for diagnostic classifiers of displacement pumps

Scientific Reports volume 14, Article number: 6098 (2024) Cite this article

220 Accesses
Metrics details

Subjects

Abstract

This paper presents the use of one-way analysis of variance ANOVA as an effective tool for ranking the features calculated from diagnostic signals and evaluates their impact on the accuracy of the machine learning system's classification of displacement pump wear.The first part includes a review of contemporary diagnostic systems and a description of typical damage of multi-piston displacement pumps and Its causes. The work also contains description of a diagnostic experiment which was conducted in order to obtain the matrix of vibration signals and the matrix of pressures measured at selected locations on the pump housing and at the pump pressure line. The measured signals were subjected to time–frequency analysis. The features of signals calculated in the time and frequency domains were ranked using the ANOVA. The next step involved the use the available classifiers in pump wear evaluation, conducting tests and assessing their effectiveness in terms of the ranking of features and the origin of diagnostic signals.

Using energy time–frequency of Hilbert Huang transform to analyze the performance of the variable valve timing engine

Article Open access 11 February 2022

A degradation feature extraction technique based on static divided symbol sequence entropy

Article Open access 03 January 2024

Displacement of mining vibrating screen obtained from acceleration based on improved S–G filter

Article Open access 07 February 2024

Introduction

Hydraulic drive and control have been used in many fields of technology. This was determined by their many favourable properties, such as: high output power density compared to the dimensions and weight of such systems, ease of automation, negligible moments of inertia of pumps and hydraulic motors, and, in most cases, their self-lubrication. The hydrostatic drive is commonly used in mobile machines, i.e., construction and agricultural machines. Power hydraulics have found application in heavy industry in press systems, rolling mills, pressure die casting machines and mining. Maintaining the operability of hydraulic systems requires keeping a given class of cleanliness of the working fluid and monitoring the physical parameters of their main components (including positive displacement pumps). This is the main task of maintenance engineering. The development of damage in a hydraulic element is usually accompanied by an increase in the temperature of the working fluid in the control signal lines and an increase in pressure pulsation and noise level. Monitoring of machines containing hydraulic systems, which is well-planned and carried out, allows for earlier determination of the working state of system components, thus facilitating the possibility of carrying out planned service works and preventing failures and resulting downtimes. Since hydrostatic drives started to be used in machines and devices, attempts have been made to predict and assess the efficiency of their components with the use of various methods¹. Examples of their implementation in the diagnosis of hydraulic components are presented below, with particular emphasis on techniques using the so-called intelligent fault identification.

The first, basic and well described in the literature tool used in machine diagnostics is the use of time and time–frequency analysis of signals measured in characteristic places of the tested system (or checked element). Such a solution was used at least in works^2,3,4 where the development of abrasive wear of the cam disk in a multi-piston positive displacement pump was monitored. Another example may be the use of the Wavelet transform in the diagnosis of the external tightness of the hydraulic cylinder, which is described in the article⁵.

Another method of diagnosing wear of hydraulic elements (or systems) is related to the construction of a model of the diagnosed element, in which the Kalman filter⁶ or the extended Kalman filter^7,8 is often used as a state observer. Such a solution was described in⁸ as an effective tool for detecting leaks in the impeller of a multi-piston pump and in⁷ for examining changes in the efficiency of a screw pump. The algorithm of the adaptive Kalman filter was used in works^9,10 to monitor the efficiency of the hydraulic manipulator and to detect the internal leakage of the hydraulic cylinder.

The next (third) group of systems used to diagnose hydraulic systems are systems with elements of the so-called intelligent damage identification based on the use of machine learning and deep learning algorithms^11,12,13. In the literature, there are many papers describing the use of such solutions in the diagnostics of hydraulic systems^14,15,16. For example, the possibility of using machine learning in the diagnostics of pumps in liquefied gas regasification installations is presented in¹⁷. In this case, the condition of the pump was tested based on the prediction of its power demand (power consumption). A comparison of the applied diagnostic models was made using a developed error analysis (minimum relative Error, absolute Error and mean square Error). The best results (the best accuracy) were recorded by the Gradient Boosted Trees GBT model. The following article¹⁸ presents a combination of the empirical wavelet analysis with the Principal Component Analysis PCA and extreme machine learning, which was used in the diagnostics of a hydraulic piston pump. First, the vibration signals measured in characteristic places of the pump body were decomposed into components of different frequencies using the empirical wavelet analysis. For the signals obtained in this way, features were calculated and then reduced in terms of significance using PCA. The input vector reduced to the most important features was fed to the classifier using an extreme machine learning algorithm. The accuracy of predicting the modelled damage by the built learning system was 100%.

The diagnosis of abrasive wear of the piston foot in an axial piston pump using the extreme machine learning algorithm is presented in the article¹⁹. Three methods of obtaining features constituting input data to the selected classification models were proposed, i.e. Wavelet Packet Transform (WPT), Empirical Mode Decomposition (EMD) and Local, Mean Decomposition (LMD). The obtained feature vectors were then ranked and the most significant ones were then given to the inputs of the classifiers. Finally, three pump condition prediction models were compared, i.e. Extreme Machine Learning ELM, Back Propagation BP and Support Vector Machine SVM.

The use of the K-nearest Neighbors classifier for effective classification of the life state of a multi-piston pump was described in²⁰. In turn, the application of the SVM algorithm to monitor the condition of a hydraulic car brake is described in²¹.

Article²² presents the use of deep machine learning in the diagnostics of an axial piston pump. Initially, time–frequency portraits were obtained using a continuous Wavelet transform of vibration signals as input data for the Convolutional Neural Networks CNN, then the ranges of the adopted hyperparameters of the network were optimized using the Bayesian optimiser. By integrating deep machine learning and the adaptive Bayesian algorithm, a modified model was obtained for diagnosing failures in pumps.

Article²³ presents the use of a deep Multi-Signal Fusion Adversarial Model Based Transfer Learning MFAN in the diagnostics of an axial displacement pump operating under variable operating conditions, i.e. variable capacity and pressure (as is the case in most drives of construction and agricultural machines). A modular structure of the system was proposed by first carrying out fusion of vibration and acoustic signals with assigning their weights, with a module for generating features and the adopted structure of the neural network. The average accuracy of MFAN in identifying damage reached 98.5%.

The use of deep machine learning to classify faults in axial piston pumps is also presented in articles^24,25. Applying the deep belief networks architecture in this case, a high accuracy of classification of the four most common axial piston pump faults was achieved. The classification accuracy was above 97%. Device monitoring with a discussion of Deep Belief Network (DBN), Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) is presented in²⁶.

Many papers on the diagnostics of displacement pump failures are based on using previously prepared models of pump component damage. Such an approach does not give the complete picture of the damage and is only its approximation. The authors have applied a different approach, which involves obtaining the pump component wear in a natural way based on a long-term pump operation under an assumed load at lowered oil purity class. In addition to obtaining the image of pump component wear in a natural way, this method allows controlling and observing the damage development and accompanying symptoms.

The review of papers on the diagnostics of hydrostatic systems indicates that most authors evaluating the efficiency of such systems use signals measured in stationary operating conditions (after the system reached thermal stability). The operational practice however proves that information included in signals measured in the non-stationary pump operation (at varying viscosity of the operational liquid) gives a fuller picture of the component wear. To evaluate the pump component wear, the authors have used signals measured in a steady state and in dynamic transient states.

In this paper, the authors have presented the application of machine learning²⁷ in the classification of wear of a multi-piston displacement pump. Machine learning systems have many advantages in engineering applications, such as the possibility of building a system of a very good classification accuracy with the use of a reasonable amount of data and short learning times of the developed diagnostic models. From the beginning, the assumption was that the learning system would be based on measures (features) obtained from vibration signals measured at characteristic locations on the pump housing, and additional signals measured on the pump pressure port by static and dynamic pressure transducers which would be obtained during a passive diagnostic experiment. The Choice of measures which were calculated from the obtained signals and the subsequent ranking of their significance was carried out using the analysis of variance (ANOVA)^28,29.

The originality of the approach involves:

possibility of using ANOVA for an effective ranking of diagnostic features and evaluation of the impact of the origin of diagnostic signals on the accuracy of classification of displacement pump wear by the machine learning system;
designing and experimenting with a way that allows obtaining the pump component wear in a natural way based on many hours of pump operation;
using the signals measured in the entire pump operation range (i.e. in the stationary and non-stationary operating conditions) to evaluate the degree of pump wear.

After the introduction, the rest of this article follows. In Section "Tested object", the most common failures of multipiston pumps are presented, as well as an example of time–frequency analysis of the vibration signals produced when the wear of the pump's valve plate is depressed. Section "Experimental part" describes the conduct of a test experiment at a laboratory station. Section "Selection of the classification system features" presents the features that were calculated from the measured diagnostic signals. Then, using ANOVA analysis of variance, the obtained features were subjected to ranking of their significance, which is described in Section "Application of ANOVA in selection of significant features of measured signals". After discussing in Section "Selection of classification algorithm" the classification algorithms used, the results obtained from the classification of the wear condition of the pump tested by the learning system are presented in Section "Pump wear classification". The article is concluded with a summary.

Tested object

The tested object was a multi-piston axial pump^30,31 with a swashplate whose detailed structural diagram is presented in²⁰.

As presented in²⁰ and^32,33, the most frequent type of wear in displacement pumps is abrasive wear. Excessive load on the rotor unit among others leads to abrasive wear of its elements and increasing radial clearance within piston-cylinder pairs. It results in increasing volumetric loss and reduced general efficiency of the pump.

The wear of the swashplate (Fig. 1) which mates with surfaces of rotor piston shoes, leads to occurrence of elliptical notch (Fig. 1) on its surface and as a consequence to a total wear of this surface. It causes reduction of pump mechanical and hydraulic efficiency. On the other hand, the wear of the valve plate is among other things caused by the decay of the lubrication layer between the disc surface and a surface of the rotor face. It results in occurrence of flow micro-conduits (Fig. 2) on the surface of the plate bridge. Such conduits cause flow of the working medium between suction and pressure zones in the pump, and consequently loss of tightness, reduction of operational pressure and volumetric efficiency of the pump.

The developing wear of the pump components can be traced with the use of conventional diagnostic methods which are based on time and time–frequency analysis of the measured signals. When damage of a pump part is initiated, the measured signal (e.g. vibration signal) starts to include synchronous components, as well as a number of quasi-periodic and chaotic components. We can use traditional techniques described in³ to analyze the measured signal and assume that the transition function is linear. When we assume that the transition function is nonlinear, we can use the methods based on deterministic chaos³⁴. An example of time–frequency analysis from vibration acceleration signals measured on the pump body (in three directions X, Y, Z) in the development of the swashplate damage which involved gradual deepening of the elliptical notch, is shown in the figures below.

The analysis of the magnitude squared of the short-time Fourier transform STFT images (called spectrograms) obtained in axes X and Y (Figs. 3 and 4) indicates that as the swashplate wear increases. The signal spectrum gradually moves towards lower frequencies and its amplitude increases. For the measurements obtained in the X direction, the range of frequency changes of the signal spectra ranges from 4 to 0.02 kHz, and for the signals measured in the Y direction, it is wider, ranging from 10 to 0.02 kHz. In the case of vibration signals from axis Z (Fig. 5), there is no clear shift towards lower or higher frequencies, but there is a tendency to reduce the signal’s amplitude as the wear develops. In this direction of measurement (Z direction), the frequency variation range of the signal spectra was 8 to 1 kHz.

Experimental part

The multi-piston components' wear tests were conducted on a purpose-built laboratory station. One of the main test objectives was to obtain the wear of the pump components in a natural way, hence the multi-hour tests were done under actual operating conditions of the pump. During the test, the diagnostic signals from the installed measuring transducers were measured:

static pressure;
dynamic pressure;
vibration acceleration on the pump body in three axes (X, Y, Z).

The acceleration was measured on the pump body in the vicinity of the valve plate and the swashplate. The static and dynamic pressures were measured on the pressure line directly on the outlet port. The sampling frequency of the measured signals was 50 kHz. A simplified diagram of the measuring setup and location of the measuring transducers is presented in Fig. 6.

During the signal measurement, the pump shaft revolution signal was also measured, and each recorded signal was divided by it. In the case of the pump running at nominal speed (n = 1500 rpm), we obtained 25 data series per second of measurement and tables of signals one-revolution long (that is 0.04 s) which corresponded to 2010 measurement points. The 1-s signals were recorded at 15-min intervals between successive measurements. The daily duration of the experiment was 10 h of pump operation under a static load.

The pump operation was monitored by measuring the pressure change at its outlet. By comparing day-by-day pressure values (for individual measurements), it was assumed that in the case when the pressure drop on the pump outlet reaches 10 percent of the load pressure (which was 7 MPa), the pump remains in good working order (classifier: pump operable). When the pressure drop on the outlet further increases (up to 20% of the initial pressure), the pump is classified as close to its end of life (classifier: end of life). When the pressure drop on the pump outlet exceeds 20 percent, the pump is classified as worn (classifier: pump worn). In total, recorded were 441 measured signals (pump body vibration, static and dynamic pressures), of which 294 (evenly 147) were signals for pump in good working order (classifier: pump operable) and pump in a transition state (classifier: end of life). The last 147 signals were from the worn pump (classifier: pump worn). The next stage in preparation of data for the pump condition classification system involved dividing the data into those used for the system learning and the data used for subsequent verification and validation of the system. It was assumed that 30% of the total data would be used to validate and test the obtained classifier, and the remainder (70%) of the real data would be used in the learning process. The prepared data were entered into the Matlab³⁵ package, where further analysis occurred. The analysis involved:

selection and calculation of appropriate signal features;
ranking of calculated features in terms of information they contain;
accepting appropriate classifiers;
evaluation of selected classifiers in the diagnostics of pump condition, in terms of choice of diagnostic signals and the applied feature ranking algorithm.

Selection of the classification system features

The following significant problem in the construction of the classification system is the selection of signal features on which the system will be based. The features of signals from the time and frequency domains were determined for each obtained matrix of the pump body vibration signals³⁶. The dimension of the vibration signal matrix for which the features were calculated was then 3 × 2010 measurement points. The procedure was the same in analysing signals from the static and dynamic pressure transducers installed in the pump pressure port. In this case, the dimension of the signal matrix for which the features were calculated was 2 × 2010 measurement points. In the case of the determination of time features of the measured signals, we defined their variability and the amount of information they contained. We used the statistical measures of location, concentration and variability. The list of features of the measured signals and relationships according to which they were calculated in presented in Table 1.³⁷

Table 1 The list of features of the measured signals.

Full size table

The frequency measures are those which are universally used in the description of the frequency domain signals: maximum power spectral density (PSD) and the frequency at which the PSD reaches its maximum.

Application of ANOVA in selection of significant features of measured signals

The calculated features of the signals are the carrier of information about the wear of the monitored pump. In general perception, the more information, the better the discriminating power of the method used to classify sets with different features^29,38. Although, theoretically, the number of calculated signal features (which constitute input data for the classifier) is unlimited, in practice, the aim is to obtain the minimum number of features that well describe the properties of the tested object. This is conducive to getting a compact model with a good fit.

To improve the efficiency of classifiers, it is required to remove correlated and irrelevant features of signals. This leads to a reduction in the dimensions of the feature matrix and allows for a reduction of the needed computing power. Moreover, the reduction of the input data reduces the model training time and prevents its overtraining when creating classifier models.

The available data reduction methods can be divided into those that only find the most important features while removing insignificant ones (Backward elimination, Forward selection and Random forests can be distinguished here) and those that combine features by means of an appropriate transformation reducing their dimension (Principal Component Analysis (PCA), Factor Analysis (FA), Linear Discriminant Analysis (LDA) belong to this group).

In the present article, an easily interpretable and well-documented ANOVA analysis of variance was used to rank the features of the measured signals.The analysis of variance is based on breaking up the total sum of squares (SST) of variance for all observation results into two components²⁸:

sum of squares describing variability within the group (SSE),
sum of squares describing variability between groups (SSR).

Mathematically, this operation can be presented as:

$$\begin{array}{*{20}c} {\mathop \sum \limits_{i = 1}^{k} \mathop \sum \limits_{j = 1}^{{n_{i} }} \left( {x_{ij} - \overline{x}} \right)^{2} = } & {\mathop \sum \limits_{i = 1}^{k} \mathop \sum \limits_{j = 1}^{{n_{i} }} \left( {x_{ij} - \overline{{x_{i} }} } \right)^{2} + } & {\mathop \sum \limits_{i = 1}^{k} \mathop \sum \limits_{j = 1}^{{n_{i} }} \left( {\overline{{x_{i} }} - \overline{x}} \right)^{2} } \\ {{\text{SST}}} & {{\text{SSE}}} & {{\text{SSR}}} \\ \end{array}$$

(1)

where n_i – number of data in the i-th group, $\overline{{x }_{i}}$ – arithmetic mean in the i-th group, $\overline{x }$ – arithmetic mean in the total n-element group, n- total number of independent observations ${x}_{ij}$ for $j=\mathrm{1,2},\cdots {n}_{i}$

$$\left( {n = \mathop \sum \limits_{i = 1}^{k} n_{i} } \right)$$

(2)

where k – total number of groups.

The data are the basis for verification of the following null hypothesis:

H₀

Means calculated in each group are equal:

$${H}_{0}: {m}_{1}={m}_{2}={\cdots m}_{k}$$

vs the alternative hypothesis ${H}_{1}$:

H₁

At least two means from the groups differ.

In the analysis of variance, the variance between groups (SSR) is compared with the variance within groups (SSE). If the ratio of variance between groups to variance within groups is significantly high, it is possible to reject the null hypothesis at the accepted level of significance α (in our calculations we used α = 0.05).

The verification is carried out using the statistical test F (Fisher-Snedecor test)^28,29:

$$F = \frac{{ SSR_{k - 1} }}{{SSE_{N - k} }} = \frac{MSR}{{MSE}}$$

(3)

where MSR – root mean square of deviations between groups:

$$MSR = \frac{SSR}{{k - 1}}$$

(4)

MSE – root mean square of deviations within group:

$$MSE = \frac{SSE}{{n - k}}$$

(5)

where k- number of groups; n- total number of independent observations.

Test F is used to rank each feature, rejecting the null hypothesis at the accepted level of significance. The low value of p – test statistics indicates that the analyzed feature is important for pump wear evaluation.

The ranking of significance obtained with ANOVA from the vibration and pressure signals is presented in Fig. 7 (vibration signals) and Fig. 8 (pressure signals).

From among 62 features which were obtained from the pump body vibration measurement, the five most significant ones were selected to evaluate pump wear. The ANOVA ranking of the most significant features is presented in Table 2.

Table 2 The ranking of the most significant features obtained from the pump body vibration measurement.

Full size table

where: YR_CrestFactor – crest factor of vibration signal measured on the valve plate in axis Y, YR_ImpulseFactor – impulse factor of vibration signal measured on the valve plate in axis Y, YR_ClearanceFactor – clearance factor calculated from the vibration signal measured on the valve plate in axis Y, ZT_PeakFreq – frequency of maximum power spectral density PSD of vibration signal measured on the valve plate in axis Z, ZT_Kurtosis – kurtosis of vibration signal measured on the valve plate in axis Y.

From among 22 features obtained from the measurement of static and dynamic pressures on the pump pressure port, the five most significant ones were selected. The features which satisfy the conditions are presented in Table 3.

Table 3 The ranking of the most significant features obtained from the static and dynamic pressures measurement.

Full size table

Selection of classification algorithm

The learning systems are increasingly used in maintenance engineering for modelling a state of an industrial process or its component (e.g. a machine) exclusively based on available measurement data assigned to the process (class). In terms of learning techniques, the learning systems are divided into supervised and unsupervised^27,39.

Both supervised and unsupervised machine learning systems offer a large group of learning algorithms and the Choice of the most appropriate one depends on many factors. Firstly, in order to choose the appropriate learning algorithm, we need to specify the task that the model will perform accurately (classification, regression, grouping). The next issue is the type and size of input data which affect the learning speed, load on the computer (controller) memory and accuracy of output data prediction (model answers). The choice of appropriate classification algorithm is not clear-cut and only an experienced operator (a long-standing researcher) can quickly indicate the appropriate algorithm. Usually, the best classification algorithm is selected in multiple trials of individual types and evaluation of the obtained classifiers in terms of speed of operation, accuracy of classification and load on the memory of the computing unit.

Among the algorithms which meet the issue²⁵ of the multi-piston pump condition classification we can find:

Decision trees

In this algorithm, data classification is performed using the decision tree based on the starting point and branches forming a binary decision making system whose end branches are the result of assigning the data to a class.

Discriminant analysis

It is based on the analysis of the Gaussian distribution of signals from the set of observations (inputs). The classifier estimates the Gaussian distribution parameters from observations and based on that assigns to an appropriate class.

Support vector machines

Classification of data by finding the best hyperplane which separates the data of one class from the data of the other class. The best hyperplane is the one which separates the data with the largest margin.

K Nearest Neighbours classifiers

It determines the membership of new data from the input set in a specific class based on the location of an assumed number (number K) of the nearest (neighbouring) data of the input set data relative to that data. The measure of location is measure of distance of the classified data from the neighbouring data.

Naive Bayes classifiers

A probabilistic classifier which (naively) assumes mutual independence of input data. Using the Bayes’ theorem, this classifier calculates the probability of data membership in a specific class.

Before starting the verification of the suitability of the aforementioned classifiers in the evaluation of pump wear, it was assumed that each classifier using 5 most significant features of measured signals evaluates the pump condition with the same accuracy as the classifier based on previously determined all features of the measured signals (i.e. 64 features determined from vibration signals or 22 features determined form pressure signals).

In order to confirm this assumption, the following null hypotheses should be assumed and verified:

Hypothesis H ₀₁

The full model using all 64 features of the measured vibration signals classified the wear condition of a multi-piston pump with the same accuracy as a simplified model using 5 most significant features.

And

Hypothesis H ₀₂

The full model using all 22 features of the measured pressure signals classified the wear condition of a multi-piston pump with the same accuracy as a simplified model using 5 most significant features.

The verification of hypotheses H₀₁ and H₀₂ was carried out for models using the K Nearest Neighbors algorithm. We used a multiple repetition (5 × 2) t-Student test with a random division of signals. The measure of the accuracy of pump wear classification by the full model and the simplified model was fit error factor e specified by the equation:

$$e = \frac{{\mathop \sum \nolimits_{j = 1}^{{n_{test} }} w_{j} I\left( {\widehat{{p_{{{\text{1j}}}} }} \ne y_{j} } \right)}}{{\mathop \sum \nolimits_{j = 1}^{{n_{test} }} w_{j} }}$$

(6)

where: n_test – number of observations; w_j – weight of the j-th observation; I(x) – function marker; is 1 for true assumption, otherwise is 0; $\widehat{{\text{p}}_{\text{1j}}}$ – identified pump condition for the first model in the jth observation; $y_{j}$ – actual pump condition in the j-th observation.

In the comparison of the pump condition classification by the full and simplified models (for features calculated from vibration signals) the evaluation of hypothesis H₀₁ was zero (h = 0 at probability p = 0.57). This indicates that hypothesis H₀₁ cannot be rejected, and, consequently, it is possible to use the simplified model based on the 5 most significant features of vibration signals.

In the comparison of the pump condition classification by the full and simplified models (for features calculated from pressure signals) the evaluation of hypothesis H₀₂ was also zero (h = 0 at probability p = 0.1). Also in this case, the hypothesis H₀₂ cannot be rejected, and, consequently, it is possible to use the simplified model based on the 5 most significant features of pressure signals.

An example of estimation of the classification errors by models using the features calculated from the pressure signals for the multiple repetitions (5 × 2) t-Student test is presented in the Table 4 and Table 5.

Table 4 Calculated errors e1 of pump wear classification for the full model using 22 features from pressure signals.

Full size table

Table 5 Calculated errors e2 of pump wear classification for the simplified model using 5 most significant features from pressure signals.

Full size table

Pump wear classification

The pump wear classification was carried out using models based on 5 most significant features of the measured vibration and pressure signals. The following classification algorithms were used^37,38:

Decision trees;
Discriminant analysis;
K Nearest Neighbours classifiers;
Naive Bayes classifiers;
Support vector machines.

The models' accuracy of the pump condition identification was verified using cross-validation. The input data (calculated values of signal features) were divided into five disjoint sets which were used as test sets, and the remaining used as the learning set. Next the mean Error was calculated. The correctness of pump condition identification by each model was presented graphically as a Confusion Matrix. Figure 9 shows the classification error matrices for the models which as input data use the calculated features from pressure measurements.

Table 6 includes a comparison of properties used to evaluate the pump condition, by the following criteria:

Classification accuracy;
Learning time;
Classification speed;
Misclassification costs.

Table 6 Comparison of properties of classifiers used to evaluate pump wear.

Full size table

Similarly to the models based on the features obtained from pressure signals, the correctness of the pump condition classification by the models using the most significant features determined from vibration signals was estimated using a Confusion Matrix and is shown in Figs. 10 and 11.

The accuracy of pump condition classification by models and their main features are presented in Table 7.

Table 7 The accuracy of pump condition classification by models and their main features.

Full size table

The comparison of model properties presented in Tables 6 and 7 indicates that models that use the features from pressure signals had better classification accuracy for each used classification algorithm than the models using the features calculated from vibration signals, while the prediction speed and model learning time were comparable. The 100% accuracy of the pump condition detection was obtained for models using the algorithm K Nearest Neighbours KNN (regardless of the types of features). The Fine Decision tree models also had high prediction accuracy, but the learning time was on average three times longer than in KNN. On the other hand, the models using Support vector machines SVM classified the pump condition with identical accuracy, regardless of input signal type. The models using the Naive Bayes classifiers and Discriminant analysis classifiers had relatively the lowest accuracy of pump condition prediction which was about 70% for the measured pressure input signals and 65% for the measured vibration signals. The classifiers had the largest error in the detection of the pump transition state (end of life label), most frequently identifying it as “pump operable”.

A physical implementation of a diagnostic system based on the machine code generated from the selected classification model requires a selection of a model which offers the highest prediction of pump condition and, at the same time, has the least classification error. Based on the comparison of model accuracies presented in Tables 6 and 7 we can say that this condition is satisfied by all K Nearest Neighbours models KNN and the Weighted KNN model (for both pressure and vibration signals). In addition, the Fine Decision tree model has the 100% accuracy for features from the measured pressure and vibration signals. Further analysis of the models mentioned above involved verification of their accuracy for newly measured pressure and pump body vibration signals at three states of operating condition.

Table 8 presents the accuracies with which the flowing models exported to the Matlab space³⁵: Fine KNN, Weighted KNN and Fine Decision tree recognized the pump condition classes based on the previously calculated features.

Table 8 The accuracies of the models.

Full size table

An example of the kurtosis distribution as a function of standard deviation of the dynamic pressure signal which was obtained using the verified Fine Decision tree model is shown in Fig. 12.

Summary

A careful preparation of input data is a key factor in modern systems that use machine learning to classify a monitored machine's wear state. The paper presents a concept of such system using a displacement pump as a diagnosed machine and operational and vibration signals as a source of information. While diagnosing displacement pumps, such data may come from operation signals such as static and dynamic pressures measured on the pump pressure port and additional signals, such as vibration signals recorded in characteristic locations on the pump body. In both cases, the input data sets containing measured signals should be as numerous as possible. In diagnostics of displacement pumps, the input data must include signals in the entire range of the pump operation, i.e. at varying working pressures and with viscosity of the working medium which varies with changing temperature. This results in a better training of the obtained classifier model and better effectiveness of the pump wear classification. The next important issue is the selection of such signal features that best determine the classes of pump condition or damage. It is desirable to determine the minimum number of features which may affect the classifier learning time during the classification process and prevent over-training. ANOVA used in the ranking of the determined features effectively arranged the features coming from signals of both vibration and pressure. The five most significant features were used to evaluate the effectiveness of the accepted pump condition classification algorithms. The obtained results unequivocally indicate that, regardless of the type of the used input features, the best classification accuracy was obtained using the K Nearest Neighbours models KNN (Fine KNN Weighted KNN models) and the Decision tree models (Fine Tree models and the Medium Tree model). Support vector machines models (Cubic and Fine Gaussian models) showed slightly worse classification accuracy. In the case of physical implementation in a diagnostic system, in addition to the accuracy with which a given model is able to predict the wear and tear of a pump, the time required to train it must be taken into account. Assuming that the results obtained on a limited number of measured signals can be related to the real operation of the system, the obtained K Nearest-Neighbours KNN models were trained on average three times faster than the other models that provided equally high prediction accuracy (Decision tree models and Support vector machines models). In turn, the best model recognition speed expressed as the number of recognized observations per second (obs/s) was achieved by the Discriminant analysis models with a lower accuracy of pump state classification.

The authors plan to continue their research on the application of deep machine learning and machine learning systems to the diagnosis of the wear state of positive displacement pumps. At the same time, they would like to extend this research to the detection of the undesirable phenomenon of cavitation, which causes erosive wear of both the suction port and pump components. Based on the results already obtained using machine learning (e.g. the example verification of the Fine Decision tree model shown in Fig. 12), the authors plan to use them to build an advisory system. Such a system would be aimed at hydraulic system operators. The idea is that the system would make use of pressure signals measured at the pump output, from which a decision would be made about the wear state of the pump and the required service.

Data availability

The datasets used during the current study available from the corresponding author on reasonable request.

References

Watton, J. Modelling, monitoring and diagnostic techniques for fluid power systems (Springer, 2007).
Google Scholar
Stojek, J. Application of time-frequency analysis for diagnostics of valve plate wear in axial-piston pump. Arch. Mech. Eng. 57(3), 309–322 (2010).
Article Google Scholar
Jabłoński, A. Condition Monitoring Algorithms in MATLAB (Springer International Publishing, Berlin, 2021).
Book Google Scholar
Roberts, M. J. Signals and Systems Analysis Using Transform Methods and MATLAB (McGraw-Hill Higher Education, 2004).
Google Scholar
Goharrizi, A. Y. & Sepehri, N. A wavelet-based approach for external leakage detection and isolation from internal leakage in valve-controlled hydraulic actuators. IEEE Trans. Ind. Electron. 58(9), 4374–4384 (2011).
Article Google Scholar
Grewal, M. S. & Andrews, A. P. Kalman Filtering Theory and Practice Using MATLAB (Wiley, New York, 2008).
Book Google Scholar
Dabrowska, A., Stetter, R., Sasmito, H., Kleinmann, S.: Extended Kalman filter algorithm for advanced diagnosis of positive displacement pumps. In A 8th IFAC Symposium on Fault Detection, Supervision and Safety of Technical Processes (SAFEPROCESS) August 29–31, 2012. Mexico City, Mexico.
Bensaad, D., Soualhi, A. & Guillet, F. A new leaky piston identification method in an axial piston pump based on the extended Kalman filter. Measurement 148, 106921 (2019).
Article Google Scholar
Asl, R. M., Hagh, Y. S., Simani, S. & Handroos, H. Adaptive square-root unscented Kalman filter: An experimental study of hydraulic actuator state estimation. Mech. Syst. Signal Process. 132, 670–691 (2019).
Article ADS Google Scholar
Bahrami, M., Naraghi, M. & Zareinejad, M. Adaptive super-twisting observer for fault reconstruction in electro-hydraulic systems. ISA Trans. 76, 235–245 (2018).
Article PubMed Google Scholar
Wang, D. et al. Wear analysis of slideway in emulsion pumps based on finite element method. Sci. Rep. 14, 1930 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Ambrożkiewicz, B. et al. Intelligent diagnostics of radial internal clearance in ball bearings with machine learning methods. Sensors. 23(13), 5875 (2023).
Article ADS PubMed PubMed Central Google Scholar
Xiong, Z., Han, C. & Zhang, G. Fault diagnosis of anti-friction bearings based on Bi-dimensional ensemble local mean decomposition and optimized dynamic least square support vector machine. Sci. Rep. 13(1), 17784 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Yang, J., Xie, G., Yang, Y., Zhang, Y. & Liu, W. Deep model integrated with data correlation analysis for multiple intermittent faults diagnosis. ISA Trans. 95, 306–319 (2019).
Article PubMed Google Scholar
Hajnayeb, A., Ghasemloonia, A., Khadem, S. E. & Moradi, M. H. Application and comparison of an ANN-based feature selection method and the genetic algorithm in gearbox fault diagnosis. Exp. Syst. Appl. 38(8), 10205–10209 (2011).
Article Google Scholar
Pan, Z., Meng, Z., Chen, Z., Gao, W. & Shi, Y. A two-stage method based on extreme learning machine for predicting the remaining useful life of rolling-element bearings. Mech. Syst. Signal Process. 144, 106899 (2020).
Article Google Scholar
De la, F. A., Crespo, M. A., Candón, E., Gómez, J. F. & Serra, J. A comparison of machine learning techniques for LNG pumps fault prediction in regasification plants. IFAC-PapersOnLine 53(3), 125–130 (2020).
Article Google Scholar
Ding, Y. et al. An EWT-PCA and extreme learning machine based diagnosis approach for hydraulic pump. IFAC-Papers On Line 53(3), 43–47 (2020).
Article Google Scholar
Lan, Y. et al. Fault diagnosis on slipper abrasion of axial piston pump based on Extreme Learning Machine. Measurement 124, 378–385 (2018).
Article ADS Google Scholar
Konieczny, J. & Stojek, J. Use of the K-nearest neighbour classifier in wear condition classification of a positive displacement pump. Sensors 21(18), 6247 (2021).
Article ADS PubMed PubMed Central Google Scholar
Jegadeeshwaran, R. & Sugumaran, V. Fault diagnosis of automobile hydraulic brake system using statistical features and support vector machines. Mech. Syst. Signal Process. 52, 436–446 (2015).
Article ADS Google Scholar
Tang, S., Zhu, Y. & Yuan, S. Intelligent fault diagnosis of hydraulic piston pump based on deep learning and Bayesian optimisation. ISA Trans. 129, 555–563 (2022).
Article PubMed Google Scholar
He, Y., Tang, H., Ren, Y. & Kumar, A. A deep multi-signal fusion adversarial model based transfer learning and residual network for axial piston pump fault diagnosis. Measurement 192, 110889 (2022).
Article Google Scholar
Wang, S., Xiang, J., Zhong, Y. & Tang, H. A data indicator-based deep belief networks to detect multiple faults in axial piston pumps. Mech. Syst. Signal Process. 112, 154–170 (2018).
Article ADS Google Scholar
Zhu, Y. et al. Intelligent fault diagnosis of hydraulic piston pump combining improved LeNet-5 and PSO hyperparameter optimization. Appl. Acoust. 183, 108336 (2021).
Article Google Scholar
Zhao, R. et al. Deep learning and its applications to machine health monitoring. Mech. Syst. Signal Process. 115, 213–237 (2019).
Article ADS Google Scholar
Joshi, A. V. Machine Learning and Artificial Intelligence (Springer Nature, Berlin, 2020).
Book Google Scholar
Esfandiari, R. S. Numerical Methods for Engineers and Scientists Using MATLAB (CRC Press, Boca Raton, 2017).
Google Scholar
Kroese, D. P., Botev, Z. I., Taimre, T. & Vaisman, R. Data Science and Machine Learning: Mathematical and Statistical Methods (CRC Press, Boca Raton, 2017).
Google Scholar
Merritt, H. E. Hydraulic Control Systems (Wiley, New York, 1967).
Google Scholar
Manring, N. Fluid Power Pumps and Motors: Analysis, Design and Control (McGraw Hill Professional, New York, 2013).
Google Scholar
Ma, J., Chen, J., Li, J., Li, Q. & Ren, C. Wear analysis of swash plate/slipper pair of axis piston hydraulic pump. Tribol. Int. 90, 467–472 (2015).
Article Google Scholar
Totten, G. E. & DeNegri, V. J. Handbook of Hydraulic Fluid Technology (CRC Press, New York, 2017).
Google Scholar
Awrejcewicz, J. & Krysko, V. A. Chaos in Structural Mechanics (Springer, 2008).
Book Google Scholar
Leis, J. W. Digital Signal Processing Using MATLAB for Students and Researchers (Wiley, New York, 2011).
Book Google Scholar
Bin, G. F., Gao, J. J., Li, X. J. & Dhillon, B. S. Early fault diagnosis of rotating machinery based on wavelet packets. Empirical mode decomposition feature extraction and neural network. Mech. Syst. Signal Process. 27, 696–711 (2012).
Article ADS Google Scholar
Sharma, V. & Parey, A. A review of gear fault diagnosis using various condition indicators. Procedia Eng. 144, 253–263 (2016).
Article Google Scholar
Bhattacharyya, S., Bhaumik, H., Mukherjee, A. & De, S. Machine learning for a big data analysis (Walter de Gruyter, Berlin, 2019).
Google Scholar
Lalik, K., Kozek, M. & Dominik, I. Autonomous machine learning algorithm for stress monitoring in concrete using elastoacoustical effect. Materials 14(15), 4116 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Funding

Research funded by Department of Process Control, AGH University of Krakow.

Author information

Authors and Affiliations

Department of Process Control, Faculty of Mechanical Engineering and Robotics, AGH University of Krakow, al. A. Mickiewicza 30, 30‐059, Krakow, Poland
Jarosław Konieczny & Jerzy Stojek
Department of Applied Mechanics and Biomechanics, Faculty of Mechanical Engineering, Cracow University of Technology, al. Jana Pawla II 37, 31-864, Krakow, Poland
Waldemar Łatas

Authors

Jarosław Konieczny
View author publications
You can also search for this author in PubMed Google Scholar
Waldemar Łatas
View author publications
You can also search for this author in PubMed Google Scholar
Jerzy Stojek
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors equally contributed to the article.

Corresponding author

Correspondence to Jerzy Stojek.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Konieczny, J., Łatas, W. & Stojek, J. Application of analysis of variance to determine important features of signals for diagnostic classifiers of displacement pumps. Sci Rep 14, 6098 (2024). https://doi.org/10.1038/s41598-024-56498-0

Download citation

Received: 18 September 2023
Accepted: 07 March 2024
Published: 13 March 2024
DOI: https://doi.org/10.1038/s41598-024-56498-0

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Using energy time–frequency of Hilbert Huang transform to analyze the performance of the variable valve timing engine

A degradation feature extraction technique based on static divided symbol sequence entropy

Displacement of mining vibrating screen obtained from acceleration based on improved S–G filter

Introduction

Tested object

Experimental part

Selection of the classification system features

Application of ANOVA in selection of significant features of measured signals

H0

H1

Selection of classification algorithm

Decision trees

Discriminant analysis

Support vector machines

K Nearest Neighbours classifiers

Naive Bayes classifiers

Hypothesis H 01

Hypothesis H 02

Pump wear classification

Summary

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links

H₀

H₁

Hypothesis H ₀₁

Hypothesis H ₀₂