Detection of incipient rotor unbalance fault based on the RIME-VMD and modified-WKN

Wang, Qian; Hu, Shuo; Wang, Xinya

doi:10.1038/s41598-024-54984-z

Download PDF

Article
Open access
Published: 26 February 2024

Detection of incipient rotor unbalance fault based on the RIME-VMD and modified-WKN

Qian Wang^1,2,
Shuo Hu¹ &
Xinya Wang²

Scientific Reports volume 14, Article number: 4683 (2024) Cite this article

293 Accesses
8 Altmetric
Metrics details

Subjects

Abstract

Due to the high incidence and inconspicuous initial characteristics of rotor unbalance faults, the detection of incipient unbalance faults is becoming a very challenging problem. In this paper, a new method of small rotor unbalance fault diagnosis based on RIME-VMD and modified wavelet kernel network (modified-WKN) is proposed. Firstly, in order to extract the small unbalance fault information from the vibration signals with low signal-to-noise ratio (SNR) more efficiently, the RIME algorithm is used to search for the optimal location of the penalty factor and decomposition layer in the variable mode decomposition (VMD). Secondly, the most relevant decomposition components to the small unbalance fault information are selected by using Pearson Correlation Coefficients and utilized to reconstruct the signal. Finally, the modified-WKN diagnostic model that is used for multi-sensor data fusion is constructed. The model can acquire features of vibration signals from multiple position sensors, which enhances the ability of the modified WKN diagnostic model to deal with incipient fault modes. Based on the experimental analysis of rotor unbalance fault datasets with different SNRs, it is verified that the detection performance of the proposed method is better than the traditional WKN and VMD-WKN methods. Specifically, the proposed method is more sensitive to the initial unbalance faults.

Fault diagnosis of anti-friction bearings based on Bi-dimensional ensemble local mean decomposition and optimized dynamic least square support vector machine

Article Open access 18 October 2023

Fault diagnosis of gearbox based on Fourier Bessel EWT and manifold regularization ELM

Article Open access 02 September 2023

Denoising method of machine tool vibration signal based on variational mode decomposition and Whale-Tabu optimization algorithm

Article Open access 27 January 2023

Introduction

As the core component of rotating machinery, the rotor system has been extensively applied in the aerospace, petrochemical, coal, and electricity industries^1,2. The primary faults in the rotor system include rotor unbalance, misalignment, rub-impact, and others³. Among these, rotor unbalance is a significant cause of instability in rotor systems. In practical engineering, the early characteristics of rotor unbalance fault signals are relatively weak. Additionally, they are always accompanied by noise and other uncertainties, leading to even weaker features. Therefore, the rapid and accurate detection of initial rotor unbalance faults is a very challenging diagnostic problem and is also a crucial safeguard for the long-term safe and stable operation of rotating machinery systems.

In the past few decades, the diagnosis methods for rotor faults can be divided into two categories⁴: one is time-frequency fault diagnosis methods, such as the wavelet transform, variational modal decomposition (VMD), and others^5,6,7. The other is knowledge-based fault diagnostic methods, which includes support vector machines, expert systems, dynamic learning, and deep learning^8,9,10,11. Currently, deep learning-based fault diagnosis methods have become a research hotspot, and various advanced learning models (CNN, LSTM, DBN, AE, and others) are widely utilized in the field of rotor fault diagnosis^{12,13,14,15,16,17}.

Among these deep learning models, CNNs stand out for their exceptional performance in fault diagnosis¹⁸. However, the CNN-based models are often considered as black boxes due to the lack of interpretability. With the advancement of learning methods, various approaches have been proposed to enhance interpretability. With the advancement of learning methods, various approaches have been proposed to enhance interpretability. Zilke et al.¹⁹ developed a novel scheme for neural network rule extraction based on decision trees to investigate the decision-making process. Grezmak et al.²⁰ utilized layer-by-layer correlation propagation as an indicator to elucidate the key features learning process of CNN from time-frequency spectrum images. Jia²¹ employed a Neuron Activation Maximization algorithm to visualize the kernels of convolutional layers, aiming to comprehend the process of feature learning. Chen²² applied Gradient Class Activation Mapping to generate an attention model and explained the model by analyzing attention matters. Li et al.²³ introduced an attention mechanism to assist deep neural networks in focusing on critical data segments, and the learned fault diagnosis characteristics can be presented in a visualized manner.

It should be noted that the rotor fault diagnosis signals are vibration data, and the above-mentioned methods are mainly suitable for processing two-dimensional image data. In order to solve this problem, Li et al.²⁴ proposed an interpretable model known as the Wavelet-Kernel Network (WKN), which is suitable for dealing with the vibration fault signals. The wavelet transformation is employed in the first convolutional layer of CNN, and the physical significance of wavelet transformation is taken as the interpretability process. However, WKN is implemented based on a single sensor signal and cannot fully capture the fault information concealed within the noise in the rotor vibration signals.

Inspired by the WKN method, this paper proposes a new rotor unbalance fault diagnosis method based on a RIME-VMD and modified-WKN. Firstly, to extract the initial unbalance fault information accurately under the condition of complex noises, the VMD decomposition algorithm is employed to decompose the vibration signals. In addition, the RIME algorithm²⁵ is used to search for the optimal combination of penalty factor $ \alpha $ and decomposition layer k of VMD. Secondly, the obtained optimal IMF components are selected by using the Pearson Correlation Coefficient (PCC), and the most relevant fault IMF components are used for the signal reconstruction. Thirdly, a new multi-head convolutional layer of the WKN is constructed to capture rotor unbalance fault information comprehensively based on the multiple vibration data from different positions in the rotor system. Additionally, this paper adopts multi-scale convolution to extract fault information of various scales in the fused features, which enhances the ability to perceive complex patterns. Finally, the diagnostic performance of the proposed method is illustrated based on the experimental analysis with varying SNRs. The results demonstrate that it is better than the traditional WKN method and the WKN combined with VMD (VMD-WKN) methods. Specifically, the proposed method is more sensitive to the initial unbalance faults.

The main contributions of this article are shown as follows:

1)
The most relevant unbalance fault IMF components are obtained according to the parameter-optimized VMD method, and the optimal combination of penalty factor $\alpha $ and number of decomposition layers k is automatically searched by embedding the RIME algorithm.
2)
Different from the WKN in Ref.²⁴, the modified-WKN fault diagnosis model was constructed by fully considering vibration data from different positions. The rotor unbalance fault information from the rotor vibration signals concealed within the noises are fully captured by using the multi-head convolution and multi-scale convolution structures. Compared with WKN using single sensor data, the modified-WKN model exhibits greater sensitivity to the small initial unbalance fault information.

The remaining sections of this article are structured as follows. "Theoretical basis" section covers the theoretical basis, introducing the theoretical foundation of the RIME algorithm and WKN. "Proposed method" section introduces the structures of the parameter-optimized VMD and modified-WKN models. "Data acquisition" section presents the rotor fault test bench and unbalance datasets. "Experimental results" section covers the experiments, and the conclusion presented in "Conclusion" section.

Theoretical basis

RIME optimization algorithm

The Rime algorithm²⁵ was proposed by Huang in 2023, which is an intelligent search method by simulating the growth process of rime in nature. The Rime algorithm can be divided into the following four phases.

In the first stage, the entire rime population R is initialized. The parameter R can be expressed by the following equation:

$$\begin{aligned} R=\left( \begin{array}{c} S_{1} \\ S_{2} \\ \vdots \\ S_{i} \end{array}\right) ; S_{i}=\left[ \begin{array}{llll} x_{i 1}&x_{i 2}&\cdots&x_{i j} \end{array}\right] \end{aligned}$$

(1)

where $ S_{i} $ represents the ith rime agent; $ x_{ij} $ denotes the jth rime particle within this agent.

The second stage is called the Soft-rime Search mechanism. The mechanism simulates the random diffusion and large area coverage of rime particles in a weak wind environment. The updated position of rime particles $R_{ij}^{\text {new}} $ in a weak wind environment can be expressed by the following equation:

$$\begin{aligned} R_{ij}^{\text {new}} = R_{\text {best}, j} + r_{1} \cdot \cos \varphi \cdot \beta \cdot \left( h\left( Ub_{ij} - Lb_{ij}\right) + L b_{ij}\right) , \quad r_{2} < E \end{aligned}$$

(2)

where $R_{i j}^{\text{ new }}$ is the updated position of the rime particle, $R_{\text{ best,j }}$ is the jth particle of the best rime agent in the population R. A random number h is used to control the center distance between two rime particles, which has the value in the range of (0, 1). The parameter $Ub_{ij}$ and the parameter $Lb_{ij}$ are the top and bottom bounds of the escape space, respectively. The movement direction of rime particles is influenced by the random variable $r_{1}$, where $r_{1} \in (0,1)$. Both $r_{1}$ and $\cos \varphi $ vary with the iteration count. The $\varphi $ is an angle over time that is affected by the current number of iterations t and the maximum number of iterations of the algorithm T. The mathematical expression for $\varphi $ is as follows:

$$\begin{aligned} \varphi = \pi \cdot \frac{t}{10 T} \end{aligned}$$

(3)

The mathematical model for the environmental factor $ \beta $ is a step function. In the Soft-rime search strategy, $ \beta $ is utilized to simulate the impact of the external environment. The mathematical expression for $ \beta $ is given by:

$$\begin{aligned} \beta = 1 - \left[\frac{\omega \cdot t}{T} \right] / {\omega } \end{aligned}$$

(4)

where the parameter $ [\cdot ] $ indicates rounding; the default setting of the parameter $\omega $ is 5, which is used to regulate the number of segments of the step function.

Parameter E represents the coefficient of being attached; $r_{2}$ is a random number. The E controls particle position update along with $r_{2}$. The range of values for $r_{2}$ is detailed in the Ref.²⁶. The E can be expressed with the following equation:

$$\begin{aligned} {E=\sqrt{(t / T)}} \end{aligned}$$

(5)

The third stage is known as the Hard-rime puncture mechanism. It promotes the exchange of information between ordinary agents and optimal agents by stimulating the growth of rime in strong wind conditions, thereby improving the precision of algorithmic solutions. The replacement equation for the particle position in the strong wind condition can be expressed by the following equation:

$$\begin{aligned} R_{ij}^{\text {new}} = R_{\text {best}, j}, \quad r_{3} < F^{\text {normr}}\left( S_{i}\right) \end{aligned}$$

(6)

where $r_{3}$ is a random number with a range of (-1,1); $F^{\text{ normr }}\left( S_{i}\right) $ is the normalized fitness value.

The fourth stage is called the Positive Greedy Selection Mechanism. If the updated fitness value is superior to the previous value, the agent’s fitness value and solution are replaced.

Theoretical basis of WKN

The Wavelet transform is a time-frequency analysis method that includes Continuous Wavelet Transform (CWT) and Discrete Wavelet Transform (DWT). Different from the Fourier transform, the basis function of the wavelet transform is a wavelet basis with finite length and attenuation. The Continuous Wavelet Convolution((CWConv) layer is implemented by utilizing the similarity between CNN convolution operations and CWT operations.

The convolution operation performed by the convolution kernel on the input signal when it passes through the convolutional layer of the CNN can be considered as an inner product operation. This process can be represented by the following equation:

$$\begin{aligned} h = W \otimes x + b \end{aligned}$$

(7)

where x represents the current input data; h is denoted as the feature map obtained after convolutional computation; $\otimes $ denotes the convolution operator; W stands for the convolutional kernel weight; b represents the bias.

Similarly, the process of CWT can be viewed as the inner product operation between the input signal and the wavelet basis functions. The continuous wavelet transform of the signal $X(t)$ can be expressed as follows:

$$\begin{aligned} {\text {CWT}}_{f}(u, s)=\left\langle X, \psi _{u, s}(t)\right\rangle =\frac{1}{\sqrt{s}} \int X(t) \psi ^{*}\left( \frac{t-u}{s}\right) dt \end{aligned}$$

(8)

where $ s $ is the scale parameter; $ u $ is the translation parameter; $ t $ is the time parameter; $ x(t) $ is the input signal; $ \psi ^{*} $ is the complex conjugate of the wavelet basis function $ \psi _{\textrm{u}, \textrm{s}} $. The wavelet basis functions $ \psi _{\textrm{u}, \textrm{s}} $ can be expressed by the following equation:

$$\begin{aligned} \psi _{u, s}(t)=\frac{1}{\sqrt{s}} \psi \left( \frac{t-u}{s}\right) \end{aligned}$$

(9)

In summary, the CWConv layer was designed based on the principle of CWT. Subsequently, the first convolutional layer of the CNN was replaced with CWConv to construct the WKN. The CWConv layer was designed to introduce interpretability to the model. The convolution operation performed by the CWConv layer on the input signal can be expressed by the following equation:

$$\begin{aligned} H=\psi _{u, {~s}}({t}) * {~g}({x}) \end{aligned}$$

(10)

where $H$ represents the feature values output by the CWConv layer; $g(x)$ represents the input signal; $*$ denotes the convolution operation.

The core part of the WKN is the selection of the wavelet kernel basis functions in the CWConv layer. In Ref. ²⁴, it has been proved that the Laplace wavelet employed in the WKN has the best performance in rotating machinery fault diagnosis. The structure of WKN is illustrated in Fig. 1, comprising the input layer, continuous wavelet convolutional layer, convolutional layer, fully connected layer, and output layer.

Proposed method

RIME-VMD

The VMD is a signal decomposition method with adaptive features. The detailed decomposition composition process of VMD can be found in Ref.²⁷. The implementation process of the VMD algorithm can be regarded as the solution process of the variational problem. The description of the constrained variational model is as follows:

$$\begin{aligned} \left\{ \begin{array}{l} \min _{u_k,\omega _k} \left\{ \sum _{k=1}^K{\left\| \partial _t\left[ \left( \delta (t)+\frac{j}{\pi t} \right) \times u_k(t) \right] e^{-j\omega _kt} \right\| _{2}^{2}} \right\} \\ \,\,\textrm{s}.\textrm{t}.\sum _{k=1}^K{u_k}(t)=f(t)\\ \end{array} \right. \end{aligned}$$

(11)

where $\partial _{t}$ represents the partial derivative with respect to ${t}; \delta (t)$ is the impulse function; $u_{k}(t)$ is the kth mode function; $\omega _{k}$ is the central frequency of the kth mode; f(t) is the original signal.

In order to solve the optimal solution of Equation (11), the quadratic penalty factor $\alpha $ and the Lagrange operator equation are introduced to transform the constrained variational problem into an unconstrained variational problem. The augmented Lagrangian quantity L can be expressed by the following equation:

$$\begin{aligned} \begin{aligned} \begin{array}{c} L\left( \left\{ \mu _{k}\right\} ,\left\{ \omega _{k}\right\} , \lambda \right) =\alpha \sum _{k}\left\| \partial _{t}\left[ \left( \delta (t)+\frac{j}{\pi t}\right) \mu _{k}(t)\right] e^{-j \omega _{k} t}\right\| _{2}^{2} \\ \quad +\left\| f(t)-\sum _{k} \mu _{k}(t)\right\| _{2}^{2}+\left\langle \lambda (t), f(t)-\sum _{k} \mu _{k}(t)\right\rangle \ \end{array} \end{aligned} \end{aligned}$$

(12)

where $ {L}(\cdot ) $ represents the augmented Lagrangian function; $ \alpha $ is the penalty factor; $ \lambda (t) $ is the Lagrange multiplier.

The method of alternate multiplication is used in order to obtain the optimal solution of Equation (12). The update equation for $ u_{k} $ and $ \omega _{k} $are as follows:

$$\begin{aligned} {\hat{u}}_{k}^{n_{1}+1}(\omega )=\frac{{\hat{f}}(\omega )-\sum _{i \ne k} {\hat{u}}_{i}(\omega )+{\hat{\lambda }}(\omega ) / 2}{1+2 \alpha \left( \omega -\omega _{k}\right) ^{2}} \end{aligned}$$

(13)

$$\begin{aligned} \omega _{k}^{n_{1}+1}=\frac{\int _{0}^{\infty } \omega \left| {\hat{u}}_{k}(\omega )\right| ^{2} \mathrm {~d} \omega }{\int _{0}^{\infty }\left| {\hat{u}}_{k}(\omega )\right| ^{2} \mathrm {~d} \omega } \end{aligned}$$

(14)

where $n_{1}$ represents the iteration number; ${\hat{u}}_{k}^{n_{1}+1}(\omega )$, ${\hat{f}}(\omega )$, ${\hat{u}}_{k}(\omega )$, and ${\hat{\lambda }}(\omega )$ are the Fourier transforms of $u_{k}^{n_{1}+1}(t)$, f(t), $u_{k}(t)$, and $\lambda (t)$, respectively.

When processing signals with the VMD algorithm, two key parameters need to be preset, namely, the penalty factor $\alpha $ and the number of decomposition layers K. Moreover, the VMD decomposition results are greatly influenced by these two key parameters²⁸. Therefore, selecting an appropriate combination of parameters is the key to processing signals using the VMD algorithm. In practical work, the values of K and $\alpha $ are typically estimated based on experience. However, due to the complexity of real signals, estimating the parameter values only empirically will not obtain optimal decomposition results. This will result in the inability to accurately extract weak incipient unbalance fault features from low signal-to-noise ratio signals. Therefore, the RIME algorithm is used in this paper to search for the optimal values of the parameter combinations K and $\alpha $. After the optimal values of K and $\alpha $ are obtained, the VMD method is then used to process the signal. Finally, the obtained optimal IMF components are selected by using the PCC, and the most relevant fault IMF components are used for the signal reconstruction.

The magnitude of the envelope entropy $E_{P}$ can reflect the sparsity property of the IMF component. When the decomposed IMF component contains more noise, the sparsity of this IMF component is weak, and the corresponding $E_{P}$ is larger. On the contrary, if the IMF component contains regular fault shocks, then the IMF component has strong sparsity, and the corresponding envelope has smaller entropy. Therefore, in this paper, the $E_{P}$ is chosen as the fitness function. The envelope entropy of the signal x(j) can be expressed by the following equation:

$$\begin{aligned} \left\{ \begin{array}{l} E_{P}=-\sum _{j=1}^{N} p_{j} \lg p_{j} \\ p_{j}=a(j) / \sum _{j=1}^{N} a(j) \end{array}\right. \end{aligned}$$

(15)

where N represents the length of the signal x(j), $p_{j}$ is the normalized form of a(j); a(j) is the envelope signal obtained by Hilbert demodulation of the signal x(j).

Modified-WKN

At present, rotating machinery is rapidly developing in the direction of intelligence, large-scale and complexity. However, due to the harsh operating environment and noise, it is difficult to fully capture the operating status of equipment with data from a single sensor^29,30,31. Therefore, in this paper, multiple sensors are utilized to collect vibration data of the rotor system from different directions (the sensor arrangement will be described in the "Data acquisition" section), and the modified-WKN for multi-source data fusion is constructed. The information from sensors at different locations is fused through a multi-head CWConv layer. Then, the fault information at different scales in multi-source features is captured by multi-scale convolution.

The structure of the modified-WKN proposed in this paper is shown in Fig. 2, which includes a multi-data source input layer, a multi-head CWConv layer, a multi-scale convolutional layer, the feature connection layer, a fully connected layer, and an output layer. Different from the WKN structure as shown in Fig. 1, the multi-head wavelet convolution layer is composed of two CWConv layers (with a convolution kernel size of $1 \times 16$) that use Laplace wavelets. The multi-scale convolution layer is composed of two convolution kernels of different sizes. The output features of the multi-head CWConv layer are connected together by Concat and input to the multi-scale convolutional layer.

Fault diagnosis method based on RIME-VMD and modified-WKN

The fault diagnosis process based on the RIME-VMD and modified-WKN proposed in this article is shown in Fig. 3. (1) The vibration data for the different directions of the rotor system is collected by sensors in several positions. (2) The collected vibration data of the rotor system in different directions are decomposed by the RIME-VMD. Then, the PCC is utilized to select the optimal IMF components which are rich in fault information. Finally, the selected optimal IMF components are used for signal reconstruction. (3) The reconstructed dataset is divided into the training set, the validation set, and the test set. (4) The model is trained with the training set and the validation set. The model is saved after training is completed. (5) The accuracy and generalization ability of the model was evaluated with a test set.

Data acquisition

The rotor unbalance experiment was conducted on a rotor system test bench. As shown in Fig. 4, the test bench consists of a motor, a motor speed control device, a signal conditioning device, the eddy current sensor, and an unbalance device. The eddy current sensors are located in the horizontal (Y direction) and vertical (X direction) directions, respectively. In this paper, different weights of counterweight screws are added to the unbalance device to simulate four degrees of unbalance faults.

In industrial production, the motors are usually operated at multiple rotational speeds to adapt to different work conditions and production scenarios. Therefore, in this paper, the motor speeds are set as 1200 rpm, 1600 rpm, and 1800 rpm to simulate the real rotor work condition. The sampling frequency was set as 5120 Hz, and the rotor fault test bench saved data every 2 s. In this paper, four different degrees of unbalance faults are preset on the rotor system fault test bench, namely: incipient unbalance (1.2 g counterweight), mild unbalance (2.5 g counterweight), moderate unbalance (3.6 g counterweight), and severe unbalance (5.0 g counterweight). In addition, the normal state (0 g counterweight) is also included, so there are five operating states in total. A total of 300 signal samples were collected for each condition, and the length of each signal sample was 10240. Therefore, the total number of samples in the rotor unbalance fault dataset is 1500 files, which contain different rotational speeds and counterweights, as shown in Table 1. The training set, validation set, and test set were divided as 6:2:2. Thus, the training set contains 900 training samples, and the test and validation set each contains 300 samples. The incipient unbalance fault in this paper refers to a rotor system that has just experienced an unbalance fault. At this stage, the signal characteristics are very similar to those under normal conditions, and these features are easily overwhelmed by noise^32,33.

In real industrial production environments, the collected data are often mixed with a large amount of unavoidable noise. However, the unbalance fault dataset collected on the rotor test bed contains less disturbing components, which are not representative of the real industrial environment. In order to make the test results realistic, this paper adds different levels of Gaussian noise (0 dB, $-2$ dB, $-4$ dB, $-6$ dB) to the original dataset to simulate the real industrial environment. The original vibration data of the incipient unbalance fault associated with the 1.2 g counterweight at 1600 rpm is shown in Fig. 5, where Fig. 5a shows the X direction, and Figure 5b shows the Y direction. The data with $-6$ dB noise added is shown in Fig. 6. The Fig. 6a shows the time-waveform from the X direction. The Fig. 6b shows the time-waveform from the Y direction.

Table 1 Rotor speeds and counterweights.

Full size table

Experimental results

Results based on the modified-WKN

In order to verify the superiority of the proposed modified-WKN model, the WKN-X (vertical direction data diagnostic model), WKN-Y (horizontal direction data diagnostic model), and modified-WKN were separately trained on the dataset with added Gaussian white noise. The WKN (WKN- X, WKN- Y) structure is shown in Table 2, and the modified-WKN structure is shown in Table 3.

Table 2 WKN structural parameters.

Full size table

Table 3 Modified-WKN structural parameters.

Full size table

During the training process, the Adam optimizer was used with a learning rate of 0.001 for all models. The average of the five training results was taken as the final result to minimize the effect of randomness. The training results after 30 epochs are shown in Table 4. The t-distributed stochastic neighborhood embedding (t-SNE) of the model classification results is shown in Fig. 7. By observing the data in Table 4, it can be concluded that the performance of the modified WKN is excellent in all four datasets with different signal-to-noise ratios. Specifically, the accuracy of modified-WKN is more than 17 percentage points higher than WKN on the -4 dB and -6 dB datasets. In addition, the stability of WKN-X and WKN-Y is relatively lower than the modified WKN.

Table 4 The accuracy of WKN and modified-WKN under different noise conditions.

Full size table

In order to observe the classification ability of the modified-WKN in a more detailed way, 1800 rpm, 1600 rpm and 1200 rpm were selected as the test data of the modified-WKN from 300 test sets. Subsequently, the output of the model was visualized using t-SNE. The classification results of modified-WKN are shown in Fig. 8. Although the modified WKN outperforms the WKN, its accuracy is still unsatisfactory in -4 dB and -6 dB noise environments. As shown in Fig. 8, the modified-WKN is unable to capture the weak incipient unbalance fault features hidden in the noise, resulting in more misclassifications of Class 0 and Class 1. Therefore, in order to minimize the interference of noise on the incipient weak unbalance signal, RIME-VMD and improved WKN based fault diagnosis is further investigated in the next subsection.

Results based on the RIME-VMD and modified-WKN

In order to verify the effectiveness of the RIME-VMD method, the RIME-VMD method is compared with the VMD method and the GWO-VMD method (Gray Wolf Algorithm, GWO) in this paper. The detailed procedure of GWO-VMD can be found in the Ref.²⁸. The detailed procedure of this experiment is as follows.

In the RIME-VMD method and the GWO-VMD method, the search range of the decomposition modulus number K is set as the range of [3, 10]; the search range of the penalty factor $\alpha $ is set as the range of [100, 2500]; the search agent quantity is set as 20. In the VMD method, the values of the number of decomposition layers K and the penalty factor $\alpha $ are set according to the signal processing experience. The X direction vibration signal depicted in Fig. 6a is used as the analysis sample in this experiment. After several rounds of iterations, the optimal parameter combination $(K,\alpha ) = (9, 560)$ is found by the RIME algorithm, which indicates that the number of decomposition layers K is 9, and the penalty factor $\alpha $ is 560. The GWO algorithm finds the optimal parameter combination $(K,\alpha ) = (8,2383)$, namely, the number of decomposition layers K is 8, and the penalty factor $\alpha $ is 2383. The value of the parameter combination $(K,\alpha )$ is set to (9, 1472) in the VMD method. The decomposition results of RIME-VMD, GWO-VMD, and VMD are shown in Fig. 9.

The PCC reflects the degree of correlation between two variables. The higher the value of the PCC, the higher the correlation between the two variables. As shown in Table 5, the correlation coefficients were categorized as no correlation, weak correlation, moderate correlation, significant correlation and strong correlation³⁵. In this experiment, the threshold for the PCC is established at 0.4. If the calculated correlation coefficient of the IMF component is higher than 0.4, the IMF component will be used for signal reconstruction. On the contrary, if the IMF component correlation coefficient is lower than 0.4, the IMF component will be discarded for signal noise reduction. The correlation coefficients of each IMF component were calculated using PCC, and the calculated results are shown in Fig. 10. The IMF components whose correlation coefficients are greater than the PCC threshold are selected for reconstructing the signal, and the reconstructed signal is shown in Fig. 11.

Table 5 Correlation coefficients and correlation.

Full size table

The training results for the reconstructed $-4$ dB and $-6$ dB datasets are shown in Table 6. Among them, WKN-X, WKN-Y, and Modified-WKN are the training results of the reconstructed dataset based on the RIME-VMD method. GWO-Modified-WKN is the training result of the reconstructed dataset based on the GWO-VMD method. VMD-Modified-WKN is the training result of the reconstructed dataset based on the VMD method. The diagnostic accuracies of both modified-WKN and WKN were significantly improved after extracting the unbalance fault information using the RIME-VMD method. However, it is of concern that the diagnostic performance of WKN still lags behind modified-WKN. Specifically, the modified-WKN fault diagnosis model achieves 99.03% and 99.45% accuracy on the reconstructed $-4$ dB and $-6$ dB datasets, respectively. In comparison, the accuracy of WKN-X and WKN-Y is 95.28% and 90.07% in the $-4$ dB dataset and 94.38% and 89.72% in the $-6$ dB dataset, respectively. From the perspective of model stability, the WKN exhibits significant fluctuations in diagnostic accuracy, showing poorer stability than the modified-WKN.

In view of extracting unbalance fault information methods, the RIME-VMD, the GWO-VMD, and the VMD methods all improve the diagnostic performance of Modified-WKN. However, among them, the diagnostic performance of Modified-WKN based on the RIME-VMD method is better. In addition, as shown in Fig. 12, GWO-Modified-WKN, VMD-Modified-WKN, and WKN based on single-sensor data (WKN-Y, WKN-X) are not able to effectively differentiate between the normal states and the incipient unbalance states.

Table 6 Accuracy of WKN and modified-WKN on reconstructed datasets.

Full size table

The physical meaning of WKN is reflected in the output features of the CWCov layer. This means that the interpretability of WKN can be expressed through the visualization of the feature maps of the CWConv layer. In addition, the degree of energy concentration on the feature map can also be used to observe the impact of noise on the model’s feature extraction. Therefore, in this paper, the feature maps of CWConv layers in different health states are visualized. The feature visualization on the $-6$ dB rotor unbalance fault data is shown in Fig. 13. The visualization of the features on the reconstructed$-6$ dB dataset based on the RIME-VMD method is shown in Fig. 14. In Figs. 13 and 14, the feature-length is represented by the horizontal axis, and the number of channels is represented by the vertical axis. Where (a) shows the normal state, (b) shows the incipient unbalance state, (c) shows the mild unbalance state, (d) shows the moderate unbalance state, and (e) shows the severe unbalance state. From Fig. 13, it can be observed that the concentration of energy in the feature map is poor. The reason is that the presence of noise interference causes a large number of fault features to be overwhelmed by noise. After extracting the most relevant fault features using the RIME-VMD method, the energy of the output feature map of the CWConv layer is very concentrated.

Conclusion

A new method of small unbalance fault diagnosis based on the RIME-VMD and modified-WKN is proposed in this paper. With the main contributions of the proposed method, the most relevent rotor fault information is extracted through the RIME-VMD, and the small incipient fault can be effectively detected by implementing the multi-head convolution and multi-scale convolution structure. According to the comparision of experiment results, it is demonstrated that the proposed method is more sensitive to the small incipient unbalance faults under the condition of noise. In the future study, the inner dynamics information of different rotor faults can can be ininvestigated combining with the proposed method to further increase the diagnosis performance and applicability.

Data availability

The data presented in this study are available on request from the corresponding authors.

References

Brito, L. C., Susto, G. A., Brito, J. N. & Duarte, M. A. An explainable artificial intelligence approach for unsupervised fault detection and diagnosis in rotating machinery. Mech. Syst. Signal Process. 163, 108105 (2022).
Article Google Scholar
Xiao, L., Yang, X. & Yang, X. A graph neural network-based bearing fault detection method. Sci. Rep. 13, 5286 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Liang, H. et al. Research on a quantitative fault diagnosis method for rotor rub-impact. Structural Health Monitoring (2023).
Nath, A. G., Udmale, S. S. & Singh, S. K. Role of artificial intelligence in rotor fault diagnosis: A comprehensive review. Artif. Intell. Rev. 54, 2609–2668 (2021).
Article Google Scholar
Wang, Y., Markert, R., Xiang, J. & Zheng, W. Research on variational mode decomposition and its application in detecting rub-impact fault of the rotor system. Mech. Syst. Signal Process. 60, 243–251 (2015).
Article ADS Google Scholar
Li, J., Lu, H., Feng, K., Liu, Y. & Zhao, Y. Research on a new diagnosis index for fixed-point rub-impact of rotor system. Eng. Fail. Anal. 125, 105394 (2021).
Article Google Scholar
Rahman, M. M. & Uddin, M. N. Online unbalanced rotor fault detection of an im drive based on both time and frequency domain analyses. IEEE Trans. Ind. Appl. 53, 4087–4096 (2017).
Article Google Scholar
Yuan, S.-F. & Chu, F.-L. Support vector machines-based fault diagnosis for turbo-pump rotor. Mech. Syst. Signal Process. 20, 939–952 (2006).
Article ADS Google Scholar
Wang, L. et al. Research on the rotor fault diagnosis method based on QPSO-VMD-PCA-SVM. Front. Energy Res. 10, 944961 (2022).
Article Google Scholar
Zhang, H. & Bai, Y. A smart diagnosis system based on automatic recognition of multiple rotor faults. Adv. Mech. Eng. 9, 1–12 (2017).
Google Scholar
Wang, Q., Wu, W., Zhang, F. & Wang, X. Early rub-impact fault detection of rotor systems via deterministic learning. Control. Eng. Pract. 124, 105190 (2022).
Article Google Scholar
Wisal, M. & Oh, K.-Y. A new deep learning framework for imbalance detection of a rotating shaft. Sensors 23, 7141 (2023).
Article ADS PubMed PubMed Central Google Scholar
Shu, Y., Zhang, W., Song, X., Liu, G. & Jiang, Q. DBF-CNN: A double-branch fusion residual CNN for diagnosis of induction motor broken rotor bar. IEEE Trans. Instrum. Meas. 72, 3536510 (2023).
Article Google Scholar
Yuhong, J., Lei, H., Yushu, C. & Zhenyong, L. An effective crack position diagnosis method for the hollow shaft rotor system based on the convolutional neural network and deep metric learning. Chin. J. Aeronaut. 35, 242–254 (2022).
Article Google Scholar
Lei, J., Liu, C. & Jiang, D. Fault diagnosis of wind turbine based on long short-term memory networks. Renew. Energy 133, 422–432 (2019).
Article Google Scholar
Yao, Y., Li, Y., Zhang, P., Xie, B. & Xia, L. Data fusion methods for convolutional neural network based on self-sensing motor drive system. In IECON 2018-44th Annual Conference of the IEEE Industrial Electronics Society, 5371–5376 (IEEE, 2018).
Deng, W., Nguyen, K. T., Medjaher, K., Gogu, C. & Morio, J. Rotor dynamics informed deep learning for detection, identification, and localization of shaft crack and unbalance defects. Adv. Eng. Inform. 58, 102128 (2023).
Article Google Scholar
Zhao, Z. et al. Deep learning algorithms for rotating machinery intelligent diagnosis: An open source benchmark study. ISA Trans. 107, 224–255 (2020).
Article PubMed Google Scholar
Zilke, J. R., Loza Mencía, E. & Janssen, F. Deepred—rule extraction from deep neural networks. In Discovery Science: 19th International Conference, DS 2016, Bari, Italy, October 19–21, 2016, Proceedings 19, 457–473 (Springer, 2016).
Grezmak, J., Zhang, J., Wang, P., Loparo, K. A. & Gao, R. X. Interpretable convolutional neural network through layer-wise relevance propagation for machine fault diagnosis. IEEE Sens. J. 20, 3172–3181 (2019).
Article ADS Google Scholar
Jia, F., Lei, Y., Lu, N. & Xing, S. Deep normalized convolutional neural network for imbalanced fault classification of machinery and its understanding via visualization. Mech. Syst. Signal Process. 110, 349–367 (2018).
Article ADS Google Scholar
Chen, H.-Y. & Lee, C.-H. Vibration signals analysis by explainable artificial intelligence (XAI) approach: Application on bearing faults diagnosis. IEEE Access 8, 134246–134256 (2020).
Article Google Scholar
Li, X., Zhang, W. & Ding, Q. Understanding and improving deep learning-based rolling bearing fault diagnosis with attention mechanism. Signal Process. 161, 136–154 (2019).
Article Google Scholar
Li, T. et al. Waveletkernelnet: An interpretable deep neural network for industrial intelligent diagnosis. IEEE Trans. Syst. Man Cybern. Syst. 52, 2302–2312 (2021).
Article Google Scholar
Su, H. et al. Rime: A physics-based optimization. Neurocomputing 532, 183–214 (2023).
Article Google Scholar
Du, P., Wang, J., Hao, Y., Niu, T. & Yang, W. A novel hybrid model based on multi-objective harris hawks optimization algorithm for daily pm2, 5 and pm10 forecasting. Appl. Soft Comput. 96, 106620 (2020).
Article Google Scholar
Dragomiretskiy, K. & Zosso, D. Variational mode decomposition. IEEE Trans. Signal Process. 62, 531–544 (2013).
Article ADS MathSciNet Google Scholar
Jin, Z., He, D. & Wei, Z. Intelligent fault diagnosis of train axle box bearing based on parameter optimization VMD and improved DBN. Eng. Appl. Artif. Intell. 110, 104713 (2022).
Article Google Scholar
Wang, X., Li, A. & Han, G. A deep-learning-based fault diagnosis method of industrial bearings using multi-source information. Appl. Sci. 13, 933 (2023).
Article CAS Google Scholar
Huo, D. et al. Gear fault diagnosis method based on multi-sensor information fusion and VGG. Entropy 24, 1618 (2022).
Article ADS PubMed PubMed Central Google Scholar
Peng, H. et al. Multi-sensor vibration signal based three-stage fault prediction for rotating mechanical equipment. Entropy 24, 164 (2022).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Liu, J., Zhang, Q., Xie, F., Wang, X. & Wu, S. Incipient fault detection of planetary gearbox under steady and varying condition. Expert Syst. Appl. 233, 121003 (2023).
Article Google Scholar
Wang, Q. & Wang, C. Incipient fault detection of nonlinear dynamical systems via deterministic learning. Neurocomputing 313, 125–134 (2018).
Article Google Scholar
Zhang, W., Li, C., Peng, G., Chen, Y. & Zhang, Z. A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load. Mech. Syst. Signal Process. 100, 439–453 (2018).
Article ADS Google Scholar
Mao, B. et al. Denoising method based on VMD-PCC in $\varphi $-OTDR system. Opt. Fiber Technol. 74, 103081 (2022).
Article MathSciNet Google Scholar

Download references

Acknowledgements

This research was funded by the Science and Technology Research Project of Henan Province Grant Number 222102220007, and partly supported by the China Postdoctoral Science Foundation Grant Number 2023M730726.

Author information

Authors and Affiliations

College of Electrical Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, 450000, China
Qian Wang & Shuo Hu
IoT Equipment Research Institute, GL TECH Co., Ltd., Zhengzhou, 450000, China
Qian Wang & Xinya Wang

Authors

Qian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shuo Hu
View author publications
You can also search for this author in PubMed Google Scholar
Xinya Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, Q.W. and S.H.; methodology, Q.W. and S.H.; software, Q.W. and S.H.; validation, Q.W. and S.H.; resources, Q.W.; data curation, Q.W.; writing-original draft preparation, S.H.; writing-review and editing, Q.W.; supervision, X.W.

Corresponding author

Correspondence to Qian Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Q., Hu, S. & Wang, X. Detection of incipient rotor unbalance fault based on the RIME-VMD and modified-WKN. Sci Rep 14, 4683 (2024). https://doi.org/10.1038/s41598-024-54984-z

Download citation

Received: 03 January 2024
Accepted: 19 February 2024
Published: 26 February 2024
DOI: https://doi.org/10.1038/s41598-024-54984-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.