A new intelligent bearing fault diagnosis model based on triplet network and SVM

Yang, Kaisi; Zhao, Lianyu; Wang, Chenglin

doi:10.1038/s41598-022-08956-w

Download PDF

Article
Open access
Published: 28 March 2022

A new intelligent bearing fault diagnosis model based on triplet network and SVM

Kaisi Yang¹,
Lianyu Zhao¹ &
Chenglin Wang²

Scientific Reports volume 12, Article number: 5234 (2022) Cite this article

4393 Accesses
9 Citations
Metrics details

Subjects

Abstract

Separating sensitive characteristic signals from original vibration data is an important challenge for rolling bearing fault diagnosis. Because it is difficult to obtain large number of damaged bearings, Rolling bearing fault datasets are often small sample datasets. For the classification of small sample rolling bearing fault datasets, we propose a coupling vibration data classification method based on triplet embedding. The method is divided into two steps: feature extraction and fault identification. First, build a triple embedding based on the CNN model to reduce the original vibration signal, and then train the SVM model for classification. Compared with traditional features and autoencoder, triplet network can learn the differences between samples. Make classification training easier and more accurate. We have evaluated the performance of this method through two bearing experiment examples. The experimental results show that this method is superior to stacked autoencoder, stacked denoising autoencoder and CNN.

Fault diagnosis of gearbox based on Fourier Bessel EWT and manifold regularization ELM

Article Open access 02 September 2023

Detection of incipient rotor unbalance fault based on the RIME-VMD and modified-WKN

Article Open access 26 February 2024

A remaining useful life prediction method based on PSR-former

Article Open access 25 October 2022

Introduction

Research status of bearing fault diagnosis

Bearing is a typical component of rotating machinery. The quality of bearing directly determines the performance of rotating machinery. Accurate and timely fault detection is the key technology to ensure the reliability and safety of bearings. Due to the wide application of bearing in various industries, the maintenance cost of bearing fault is high, the consequences of bearing fault are serious. So the fault diagnosis of bearing and condition monitoring are very active research fields, which is very helpful for the early warning and fault location of rotating machinery. At present, on-line diagnosis and prediction of machine condition are used for fault early warning and maintenance.

With the progress of sensor technology and signal processing technology, we can obtain higher precision vibration signal, acoustic signal, current signal, voltage signal, temperature signal and so on. In recent years, the research of bearing fault diagnosis is gradually increasing, and the research based on Shaw learning is in-depth^{1,2,3,4,5,6,7,8}. In order to solve over decomposition and the problem of information loss, Li et al.⁹ have proposed Independence-oriented VMD to identify wheelset bearing faults orderly. For early fault prediction of bearings, Li et al.¹⁰ have proposed adaptive multi-scale morphological analysis and bandwidth empirical mode decomposition. Zhang et al.¹¹ have proposed A bearing fault diagnosis method using variational mode decomposition. Through the analysis of the failure mechanism, they established the fault signal calculation model of the defects in different positions of the rolling bearing. In order to realize the adaptive separation of Fourier spectrum, Zheng et al.¹² have proposed the adaptive parameterless EWT method. Wu et al.¹³ have used the fully integrated empirical mode decomposition of adaptive noise and the Hilbert-Huang transform method to extract multiple degraded features in the degraded feature extraction stage. They selected monotonous, robust, and fault-related degradation characteristics, and merged them with the Mahalanobis distance health index as the main component. Yan et al.¹⁴ have proposed a fault classification algorithm based on SVM optimized by multiple features. The fault feature information is extracted from time domain by VMD, FFT and statistical analysis, frequency domain and time-frequency domain respectively.

There are also a lot of researches on bearing fault diagnosis models based on deep learning^{15,16,17,18,19,20,21,22,23}. Xia et al.²⁴ have proposed a multi-sensor-based CNN model, and classification accuracy of the model on the Case Western Reserve University bearing dataset reached 99.41%. Liu et al.²⁵ have proposed a dislocated time series CNN model, which uses the dislocated time series of the original signal to train CNN. Jiang et al.²⁶ have proposed a multi-scale CNN, and the classification accuracy on the Case Western Reserve bearing dataset reached 98.53%. Zhang et al.²⁷ have proposed a long short memory recurrent neural network model to evaluate the bearing performance degradation. Zhao et al.²⁸ have developed a variant of the deep residual network model, which uses dynamic weighted wavelet coefficients to optimize the deep residual network to improve the diagnostic performance. The input of the network is a series of wavelet packet coefficient sets in different frequency bands. Wang et al.²⁹ have proposed a method to transform the vibration information of multiple sensors into image information. This method can integrate the information and obtain more abundant features than single sensor vibration signal. Yan et al. has done a lot of research in this field, The models they studied including a novel architecture named multiscale cascading deep belief network(MCDBN) for identify the fault location of the rotating machinery; a novel approach called multi-domain indicator-based optimized stacked denoising autoencoder for automatic fault identification of rolling bearing and a novel hybrid deep learning model for multistep forecasting of diurnal wind speed^30,31,32.

There are also a lot of researches on transfer learning and small sample model prediction^{33,34,35,36,37}.

Problems of existing models

1.
Traditional feature extraction methods, such as empirical mode decomposition, wavelet transform, fast Fourier transform, etc. need a lot of expert experience. Because of the sensor noise, interference, shafting misalignment, etc. the initial fault is easy to be covered by clutter, difficult to show in the time-frequency spectrum. This makes it difficult to find the initial bearing fault;
2.
The end-to-end network has strong nonlinear fitting capabilities, but requires a large amount of sample data. Different from the image classification task, the test conditions of the industrial bearing dataset are harsh. It requires high-precision vibration sensors, servo motors, a set of high-precision shafting, and a stable data acquisition and control system. And it is difficult to obtain a large number of samples of damaged bearings. The public datasets are generally small sample datasets. However, the advantages of end-to-end networks on small sample datasets are not obvious, and it is difficult to learn the distribution of samples;
3.
Widely used autoencoder networks(AE) such as stacked autoencoders(SAE), stacked denoising autoencoders(SDAE), etc. can map data to lower dimensions. Then use traditional classification algorithms, such as SVM, random forest, etc. to classify, and you can get better results on small sample datasets. But the prime objective of the autoencoder is to encode data into low-dimensional expression, and then restore the low-dimensional expression to the original signal as much as possible. Only pay attention to whether the restored signal is slightly different from the original signal, and some important information used for classification may be discarded as noise, which makes the classification results of algorithms such as SVM and random forest inaccurate;

Fault diagnosis using the triplet network and SVM

This paper mainly studies the problem of bearing fault diagnosis. These damaged parts include the inner ring of the bearing, the balls, and the outer ring of the bearing. Use the vibration sensor to collect the vibration of the shaft system and analyze the vibration signal changes caused by the damage of different parts of the bearing. Because the bearing will produce a certain amount of vibration during high-speed rotation due to shaft deflection, machining error, etc., it is difficult to identify early fault signals. Moreover, because the bearing failure dataset is generally a small sample dataset, it is difficult to obtain lots of bearing data of different parts of the failure. This further challenges the possibility of using a single feature in bearing fault classification.

Basic SAE and SDAE

AE is a kind of artificial neural network. AE updates network weights through unsupervised learning to learn the mapping of input data to feature space. AE consists of an encoder and a decoder. The encoder f maps the input data to a low dimensional space through function $h = f\left( x \right)$. To obtain a low dimensional representation h of the original data. The decoder g maps h back to the input space through the function ${\hat{x}} = g\left( h \right)$ to obtain the reconstructed data ${\hat{x}}$ of x. Generally, the loss function of a network can be defined as:

$$\begin{aligned} L = \frac{1}{n}\sum \limits _{i = 1}^n {\left\| {{x_i} - Net({x_i})} \right\| } \end{aligned}$$

(1)

The loss function calculates the Euclidean distance between x and ${\hat{x}}$ to reduce the reconstruction error and obtain the reconstructed data closer to the original data. The output h of the encoder can be used as a low dimensional representation of the input data. The encoder and decoder of SAE are composed of multiple AE nested to form a structure with multiple hidden layers. Compared with AE, SAE can better learn the deep features of the original data.The structure of SAE is shown in Fig. 1.

The basic idea of DAE is to reconstruct the original input from a damaged input to obtain a robust representation of the original input. This can prevent AE from learning only the mapping between input and construction output, and capture more informative hidden patterns. In general, the input x is added to the noise $\sigma$ to get the damage signal ${\tilde{x}}$ form x, and then ${\tilde{x}}$ is input to the DAE to get the reconstructed data of ${\hat{x}}$, Formula 2 calculates the loss function:

$$\begin{aligned} L = \frac{1}{n}\sum \limits _{i = 1}^n {\left\| {{x_i} - Net\left( {{x_i} + {\sigma _i}} \right) } \right\| } \end{aligned}$$

(2)

The network structure of DAE is shown in the Fig. 2.

SDAE stacks the DAEs into a deep network to obtain deep features of input data. The SDAE model destroys the original data x into ${\tilde{x}}$ by adding Gaussian noise $\sigma$, and then trains the SDAE model to get a better reconstructed signal.

1DCNN based on triplet loss

We believe that in a small sample of bearing fault dataset, in order to extract the deep information in the waveform, the CNN model needs to have sufficient depth. The performance of end-to-end learning in small sample datasets is not stable enough, and it is difficult to learn a better distribution law. In order to better classify bearing faults, this paper proposes a SVM fault classification method using trplet network as data preprocessing. The triplet network converts the problem of the mapping relationship between learning data to classification into a problem of different relationships between learning data. This method has a good performance in small sample data. The established one-dimensional CNN model has 8 hidden layers, and the model structure is shown in Fig. 3.

Establish a triplet network based on the 1DCNN model. In order to train triplet network, the data is classified according to labels:

a: anchor, any sample in the training set
p: positive, a sample of the same category as a
n: negative, a sample different from a

The triplet network output layer generally has fewer dimensions, which is equivalent to the high-order feature representation of the original signal. Input the same batch of data into the model, and calculate the triplet loss based on the sample label. the goal of triplet loss is to reduce The distance between a and p, and increases the distance between a and n in the embedding space. We need to introduce margin to make the distance between samples of different categories larger than margin, which can make the network encode samples of different categories farther. the loss of a triplet (a, p, n) is:

$$\begin{aligned} loss = \max \left( {\left\| {a - p} \right\| - \left\| {a - n} \right\| + margin,0} \right) \end{aligned}$$

(3)

In the formula, $\left\| {a - p} \right\|$ is the Euclidean distance between a sample and p sample, and $\left\| {a - n} \right\|$ is the Euclidean distance between a sample and n sample. The goal of the algorithm is to reduce the value of $\left\| {a - p} \right\|$ as much as possible to make the distance between a and p closer, and increase the value of $\left\| {a - n} \right\|$ as much as possible to make the distance between a and n longer. Make the value of $\left\| {a - n} \right\| - \left\| {a - p} \right\|$ small enough to achieve the goal of the algorithm. By calculating $\max \left( {\left\| {a - p} \right\| - \left\| {a - n} \right\| + m\arg in,0} \right)$ , we can control the aggregation degree of positive samples and the dispersion degree of negative samples in the feature space, so that the distance between positive samples and negative samples is greater than the margin value as much as possible. There are three situations when calculating triplet loss:

1.
easy triplets: $loss=0$, $\left\| {a - p} \right\| + margin < \left\| {a - n} \right\|$. This situation does not need to be optimized. The distance between a and p is very close, and the distance between a and n is far and greater than the sum of the distance between a and p and the margin, as shown in the Fig. 4.
2.
hard triplets: $\left\| {a - n} \right\| < \left\| {a - p} \right\|$, The distance between a and p is long, and this situation will produce a greater loss. As shown in Fig. 5.
3.
semi-hard triplets: $\left\| {a - p} \right\|< \left\| {a - n} \right\| < \left\| {a - p} \right\| + margin$, The distance between a and p is short, and the distance between a and n is long, but the distance between a and n is less than the distance between a and p plus margin. This situation will produce less loss. As shown in Fig. 6.

Input all the training set data into the network to calculate the corresponding 64-dimensional embedding. Take one of the samples a. There are a total of m samples with the same label as sample a, forming set $P = \{ {p_1},{p_2},{p_3} \cdots {p_m}\}$, There are a total of n samples with labels different from sample a, forming set $N = \left\{ {{n_{\mathrm{{1}}}},{n_2},{n_3} \cdots {n_n}} \right\}$. Calculate the distance ${D_p}$ between a and all positive samples:

$$\begin{aligned} {D_p} = \left\{ {\left\| {a - {p_1}} \right\| ,\left\| {a - {p_2}} \right\| ,\left\| {a - {p_3}} \right\| , \ldots ,\left\| {a - {p_k}} \right\| , \ldots ,\left\| {a - {p_m}} \right\| } \right\} \end{aligned}$$

(4)

Calculate the distance between a and all negative samples to form set ${D_n}$:

$$\begin{aligned} {D_n} = \left\{ {\left\| {a - {n_1}} \right\| ,\left\| {a - {n_2}} \right\| ,\left\| {a - {n_3}} \right\| , \cdots ,\left\| {a - {n_s}} \right\| , \cdots ,\left\| {a - {n_n}} \right\| } \right\} \end{aligned}$$

(5)

Take one element in set ${D_p}$ and set ${D_n}$ respectively, and combine them with a to form a triplet. All possible combinations form set T:

$$\begin{aligned} T = \left\{ {\left. d \right| d = max\left( {\left\| {a - {p_k}} \right\| - \left\| {a - {n_s}} \right\| + margin,0} \right) } \right\} \end{aligned}$$

(6)

There are $m \times n$ combinations of elements in ${D_p}$ and ${D_n}$, the total number of samples is $m + n + 1$, and the value of a has $m + n + 1$ possibilities, so the size of the set T is $m \times n \times \left( {m + n + 1} \right)$. Select semi-hard triplets in set T, get set ${T_s} = \left\{ {{d_1},{d_2},{d_3} \cdots {d_a}} \right\}$ , and calculate triplet loss:

$$\begin{aligned} loss = \frac{1}{a}\sum \limits _{i = 1}^a {{d_i}} \end{aligned}$$

(7)

Figure 7 shows the training process of triplet network and SVM. Figure 8 is block diagram of the proposed method. Firstly, input the labeled vibration data into the model. Then, initialize the triplet network model parameters, and input the collected bearing data set into triplrt network for training. The loss function of the model is defined as triplet loss, and the output of the model is a 64 dimensional vector. The trained model maps the samples to the high-dimensional feature space, and the distance between similar samples is small, and the distance between different samples is large. Finally, using high-dimensional features for SVM classification, we can judge whether the bearing fault and the fault location.

Different from AE, Triplet loss calculates the difference between the same sample and different samples, and backpropagates to update weights of the network so that the distance between the same samples is closer and the distance between different samples is longer.

Validation of proposed method

Data Castle bearings dataset

Dataset introduction

This paper uses the bearing data set provided by Data Castle to verify the method³⁸. Data Castle collected the vibration information of normal and faulty bearings. There are three types of bearing faults, including inner race, outer race and ball. The bearings include three different diameters, so there are nine different bearing fault labels. The dataset we used is shown in Table 1.

Table 1 Data Castle bearings dataset.

Full size table

In order to verify the robustness of the algorithm, we add white noise with mean value of 0 and standard deviation of 0.2 to the original data, and use the data after adding noise for model training to test whether the model can accurately judge the samples with insignificant features. The data after adding noise is shown in the Fig. 9.

After adding noise, the characteristics of data have become less obvious, and the similarity of similar characteristics decreases after adding random noise, which requires the model to have good robustness to deal with these changes.

Experiment

Experimental steps:

1.
Process the vibration data of the bearing and add appropriate labels to the vibration data.
2.
Add white noise to the vibration data to make the data characteristics less obvious.
3.
Divided the collected raw vibration data into training dataset and test dataset, and trained the network.
4.
Inputed all training samples into the model, calculated the high-order features of all training samples, and used the high-order features to train the SVM classifier.
5.
Test samples are input into the model to train high-order features, and inputed the high-order features into the trained SVM classifier for classification to verify the effectiveness of the proposed method.

In order to verified feasibility and superiority of the method, standard SAE, SDAE, and CNN were used for comparison. All the following experiments were performed in python code.

The key parameters of SAE: Encoder contains a 6-layer neural network, including three convolutional layers and three down-sampling layers. To map the original vibration data into a 64-dimensional space vector, decoder contains 3 deconvolution layers to restore the 64-dimensional space vector to vibration data. We use Gaussian kernel function, the penalty coefficient is 1, and the Gamma value is 1/64. The 64-dimensional space vector output by the Encoder is input to the SVM classifier for classification.
The key parameter of SDAE: Gaussian noise is added to the original data. The mean value of the added Gaussian noise is 0, and the standard deviation is 0.05. Other parameters of SDAE are the same as SAE.
The key parameters of the CNN model: The end-to-end CNN model has 6 hidden layers, the first layer is One-dimensional convolutional layer which has 32 convolution kernels, the second layer is down-sampling layer, the third layer is One-dimensional convolutional layer which has 64 convolution kernels. The fourth layer is down-sampling layer. The fifth layer is One-dimensional convolutional layer which has 128 convolution kernels. The sixth layer is global average pooling layer. The seventh layer is fully connected layer with 64 neurons. The output layer activation function is softmax. Optimizer is RMSprop. Learning efficiency is set to 0.0003, and the loss function uses sparse categorical crossentropy.
The first layer of the Triplet network is convolutional layer with 32 convolution kernels. The second layer is downsampling layer. The third layer is convolutional layer with 64 convolution kernels. The fourth layer is downsampling layer. The fifth layer is convolutional layer with 128 convolution kernels. The sixth layer is global average pooling layer. The seventh layer is fully connected layer with 64 neurons. The eighth layer is L2 regular layer. Margin value is 1. Optimizer is RMSprop. The learning rate is 0.0003. The trained triplet network is equipped with feature mapping capabilities. SVM classifier are used for classification in the feature space. The kernel function of SVM is set to linear kernel, the penalty coefficient is 10.

Table 2 lists the average test accuracy of each model. Through analysis of the training results, it is found that the classification accuracy of triplet network+SVM reaches 96.31%, and the classification accuracy is much higher than SAE+SVM and DAE+SVM. Due to the lack of inter-sample reference, the traditional encoder network only pays attention to the difference between the decoded signal and the original signal, and may discard some subtle features. These features may have high weights when doing classification. The end-to-end CNN network is not as good as triplet network+SVM due to the small sample size.

Table 2 Test accuracy of each model.

Full size table

As shown in Fig. 10, it can be seen from the confusion matrix that the signals misclassified by SAE+SVM and DAE+SVM are similar to their correct classification, which means that the signal compressed by the autoencoder loses some information for classification. This causes the accuracy of the SVM classifier to decrease. the classification error of end-to-end CNN model reaches 89.01%. However, due to the insufficient sample size, it is difficult to further improve the predictive ability of the model. Training triplet network on a small sample dataset, the network learned how to distinguish samples with different labels. This makes the distance between samples of the same type smaller in the feature space, and samples of different types become larger in the feature space.

In order to compare the feature extraction capabilities of the methods, PCA dimensionality reduction was performed on the output of the three models. Select the first two principal components to visualize the features.

It can be seen from Fig. 11 that the different types of data that have been reduced by the Triplet network are far apart in the feature space, and the network has learned the differences between the data. However, the high-dimensional features learned by SAE and DAE cannot find the difference between different types of data well.

The Calinski-Harabaz Index(CH) is used to evaluate classification of feature spaces. The tightness within the class is measured by calculating the sum of the squares of the distance between each point in the class and the class center. The degree of separation of the dataset is measured by calculating the sum of squared distances from various center points. The function is:

$$\begin{aligned} CH(k) = {{\frac{{BGSS}}{{K - 1}}} \bigg {/} {\frac{{WGSS}}{{n - K}}}} \end{aligned}$$

(8)

n represents the number of samples in dataset, K represents the number of categories. The values of WGSS and BGSS are obtained by formula 9 and 10:

$$\begin{aligned} WGSS= & {} \frac{1}{2}\left[ ({n_1} - 1){\overline{d}} _1^2 + \cdots + ({n_k} - 1){\overline{d}} _k^2\right] \end{aligned}$$

(9)

$$\begin{aligned} BGSS= & {} \frac{1}{2}\left[ (K - 1){{\overline{d}} ^2} + (n - K){A_K}\right] \end{aligned}$$

(10)

$$\begin{aligned} {A_K}= & {} \frac{1}{{n - K}}\sum \limits _i^K {({n_i} - 1)({{{\overline{d}} }^2} - {\overline{d}} _i^2)} \end{aligned}$$

(11)

${\overline{d}} _j^2$ represents the average distance between samples of type j, $j = 1,2, \cdots ,k$. ${{\overline{d}} ^2}$ is the average distance between all samples.

The larger the CH index value, the smaller the intra cluster distance, the larger the intra cluster distance, the higher the degree of intra-class aggregation, the higher the degree of distinction between classes, and the easier it is to classify. The smaller the CH index value, the larger the distance within the class, and the smaller the distance between the classes, which makes it difficult to classify. The CH index values of SAE, SDAE, and triplet network are shown in Table 3.

Table 3 CH index of each algorithm.

Full size table

XJTU-SY bearing dataset

Introduce

The bearing testbed consists of supporting shaft, motor governor, AC induction motor, hydraulic loading system and two supporting bearings. The testbed is designed for accelerated degradation test of bearings under different working condition. The radial force is exerted on the shell of the tested bearing by the hydraulic loading system, and the speed controller of AC induction motor sets and maintains the speed of the whole shaft system³⁹. As shown in Fig. 12.

The acceleration sensor used in the experiment is PCB 352c33. The sensor is installed on the magnetic base, which is fixed on the vertical and horizontal direction of the test bearing. The dynamic signal collector uses DT 9837. the sampling frequency of the original data is 25.6 khz, the sampling interval of the original data is 1 min, and the sampling time of the original data is 1.28 s. Table 4 shows the detailed information of each tested bearing, including its corresponding operating conditions, bearing lifetime and failure location.

Table 4 XJTU-SY bearing dataset.

Full size table

Figure 13 shows the failure bearings. It can be observed that the bearing failure is mainly caused by outer race wear, inner race wear, outer race fracture and cage fracture. More than two types of composite damage may occur to the bearing. Compound damage has multiple waveform characteristics of damage locations, which increases the difficulty of classification. The method adopted in this paper is to assign separate labels for compound damage.

Experiment

XJTU-SY Bearing Dataset contains the full lifecycle vibration signals of 15 bearings. In order to divide the dataset, we take the data with amplitude less than 3g as the vibration data of normal bearing and the data greater than 5g as the vibration data of faulty bearing. Since the collected signals are vibration signals in both the vertical and horizontal directions, the two channels of the vertical vibration signal and the horizontal vibration signal are used as the input of the model.

In order to verify the robustness of the four models, we add white noise with mean value of 0 and standard deviation of 2 to the original data, as shown in the Fig. 14, in which the blue curve is horizontal vibration data and the Yellow curve is vertical vibration data.

Train SAE, SDAE, CNN and triplet network models, and input test dataset into the model. The results are shown in Table 5.

Table 5 Diagnosis result of each algorithm.

Full size table

It can be seen that the results of triplet network+SVM on XJTU-SY Bearing Dataset are better than CNN, DAE+SVM and SAE+SVM. The classification accuracy of the triplet network+SVM reached 97.09%. The self-encoder is better than the end-to-end CNN.

Conclusion

We propose a bearing faults classification method based on triplet network and SVM. In this paper, the proposed method consists of two main steps. First, we propose a one-dimensional convolution model based on Triplet loss. The input of the model is original vibration data of bearings and the output is 64-dimensional high-order features. The loss function calculates the L2 norm distance between samples in the high-order feature space. this makes the distance between the same category closer, the distance between different category farther. Second, use SVM to classify in the high-order feature space. Two examples were given to illustrate the superiority of this method in dealing with small sample classification problems. The triplet network based on the CNN model can well extract the high-order features of the one-dimensional vibration signal, and these features can well express the difference between the signals. Compared with CNN, DAE+SVM, SAE+SVM, the algorithm performs better on small sample classification problems, and can accurately determine the fault category of bearing.

Although the existing algorithms have conducted in-depth research on the fault classification of rotating machinery, there is still a lack of systematic research on the early warning of fault. Next, we will study the early fault diagnosis and fault location identification of bearing, so as to predict the possible faults of bearing earlier. Early warning can avoid equipment damage caused by bearing failure.

For data citations of datasets uploaded to e.g. figshare, please use the howpublished option in the bib entry to specify the platform and the link, as in the Hao:gidmaps:2014 example in the sample bibliography file.

References

Qin, Y. A new family of model-based impulsive wavelets and their sparse representation for rolling bearing fault diagnosis. IEEE Trans. Ind. Electron. 65, 2716–2726. https://doi.org/10.1109/tie.2017.2736510 (2018).
Article Google Scholar
Zhang, X., Miao, Q., Zhang, H. & Wang, L. A parameter-adaptive VMD method based on grasshopper optimization algorithm to analyze vibration signals from rotating machinery. Mech. Syst. Signal Process. 108, 58–72. https://doi.org/10.1016/j.ymssp.2017.11.029 (2018).
Article ADS Google Scholar
Zhao, M. & Jia, X. D. A novel strategy for signal denoising using reweighted SVD and its applications to weak fault feature enhancement of rotating machinery. Mech. Syst. Signal Process. 94, 129–147. https://doi.org/10.1016/j.ymssp.2017.02.036 (2017).
Article ADS Google Scholar
Wang, L., Liu, Z. W., Miao, Q. & Zhang, X. Time-frequency analysis based on ensemble local mean decomposition and fast kurtogram for rotating machinery fault diagnosis. Mech. Syst. Signal Process. 103, 60–75. https://doi.org/10.1016/j.ymssp.2017.09.042 (2018).
Article ADS Google Scholar
Delgado-Arredondo, P. A. et al. Methodology for fault detection in induction motors via sound and vibration signals. Mech. Syst. Signal Process. 83, 568–589. https://doi.org/10.1016/j.ymssp.2016.06.032 (2017).
Article ADS Google Scholar
Li, Y. B., Li, G. Y., Yang, Y. T., Liang, X. H. & Xu, M. Q. A fault diagnosis scheme for planetary gearboxes using adaptive multi-scale morphology filter and modified hierarchical permutation entropy. Mech. Syst. Signal Process. 105, 319–337. https://doi.org/10.1016/j.ymssp.2017.12.008 (2018).
Article ADS Google Scholar
Li, Z. X., Jiang, Y., Guo, Q., Hu, C. & Peng, Z. X. Multi-dimensional variational mode decomposition for bearing-crack detection in wind turbines with large driving-speed variations. Renew. Energy 116, 55–73. https://doi.org/10.1016/j.renene.2016.12.013 (2018).
Article Google Scholar
Wang, D., Zhao, Y., Yi, C., Tsui, K. L. & Lin, J. H. Sparsity guided empirical wavelet transform for fault diagnosis of rolling element bearings. Mech. Syst. Signal Process. 101, 292–308. https://doi.org/10.1016/j.ymssp.2017.08.038 (2018).
Article ADS Google Scholar
Li, Z. P., Chen, J. L., Zi, Y. Y. & Pan, J. Independence-oriented VMD to identify fault feature for wheel set bearing fault diagnosis of high speed locomotive. Mech. Syst. Signal Process. 85, 512–529. https://doi.org/10.1016/j.ymssp.2016.08.042 (2017).
Article ADS Google Scholar
Li, Y. B., Xu, M. Q., Liang, X. H. & Huang, W. H. Application of bandwidth EMD and adaptive multiscale morphology analysis for incipient fault diagnosis of rolling bearings. IEEE Trans. Ind. Electron. 64, 6506–6517. https://doi.org/10.1109/tie.2017.2650873 (2017).
Article Google Scholar
Zhang, M., Jiang, Z. N. & Feng, K. Research on variational mode decomposition in rolling bearings fault diagnosis of the multistage centrifugal pump. Mech. Syst. Signal Process. 93, 460–493. https://doi.org/10.1016/j.ymssp.2017.02.013 (2017).
Article ADS Google Scholar
Zheng, J. D., Pan, H. Y., Yang, S. B. & Cheng, J. S. Adaptive parameterless empirical wavelet transform based time-frequency analysis method and its application to rotor rubbing fault diagnosis. Signal Process. 130, 305–314. https://doi.org/10.1016/j.sigpro.2016.07.023 (2017).
Article Google Scholar
Wu, J. et al. Degradation data-driven time-to-failure prognostics approach for rolling element bearings in electrical machines. IEEE Trans. Ind. Electron. 66, 529–539. https://doi.org/10.1109/tie.2018.2811366 (2019).
Article ADS Google Scholar
Yan, X. A. & Jia, M. P. A novel optimized SVM classification algorithm with multi-domain feature and its application to fault diagnosis of rolling bearing. Neurocomputing 313, 47–64. https://doi.org/10.1016/j.neucom.2018.05.002 (2018).
Article Google Scholar
Jing, L. Y., Zhao, M., Li, P. & Xu, X. Q. A convolutional neural network based feature learning and fault diagnosis method for the condition monitoring of gearbox. Measurement 111, 1–10. https://doi.org/10.1016/j.measurement.2017.07.017 (2017).
Article ADS Google Scholar
Jing, L. Y., Wang, T. Y., Zhao, M. & Wang, P. An adaptive multi-sensor data fusion method based on deep convolutional neural networks for fault diagnosis of planetary gearbox. Sensorshttps://doi.org/10.3390/s17020414 (2017).
Article PubMed PubMed Central Google Scholar
Lu, C., Wang, Z. Y. & Zhou, B. Intelligent fault diagnosis of rolling bearing using hierarchical convolutional network based health state classification. Adv. Eng. Inf. 32, 139–151. https://doi.org/10.1016/j.aei.2017.02.005 (2017).
Article Google Scholar
Pan, J., Zi, Y. Y., Chen, J. L., Zhou, Z. T. & Wang, B. Liftingnet: A novel deep learning network with layerwise feature learning from noisy mechanical data for fault classification. IEEE Trans. Ind. Electron. 65, 4973–4982. https://doi.org/10.1109/tie.2017.2767540 (2018).
Article Google Scholar
Verstraete, D., Ferrada, A., Droguett, E. L., Meruane, V. & Modarres, M. Deep learning enabled fault diagnosis using time-frequency image analysis of rolling element bearings. Shock Vib.https://doi.org/10.1155/2017/5067651 (2017).
Article Google Scholar
Wang, S. H., Xiang, J. W., Zhong, Y. T. & Zhou, Y. Q. Convolutional neural network-based hidden Markov models for rolling element bearing fault identification. Knowl. Based Syst. 144, 65–76. https://doi.org/10.1016/j.knosys.2017.12.027 (2018).
Article Google Scholar
Hoang, D. T. & Kang, H. J. Rolling element bearing fault diagnosis using convolutional neural network and vibration image. Cognit. Syst. Res. 53, 42–50. https://doi.org/10.1016/j.cogsys.2018.03.002 (2019).
Article Google Scholar
Han, T., Liu, C., Yang, W. G. & Jiang, D. X. A novel adversarial learning framework in deep convolutional neural network for intelligent diagnosis of mechanical faults. Knowl. Based Syst. 165, 474–487. https://doi.org/10.1016/j.knosys.2018.12.019 (2019).
Article Google Scholar
Wang, L. H., Zhao, X. P., Wu, J. X., Xie, Y. Y. & Zhang, Y. H. Motor fault diagnosis based on short-time Fourier transform and convolutional neural network. Chin. J. Mech. Eng. 30, 1357–1368. https://doi.org/10.1007/s10033-017-0190-5 (2017).
Article Google Scholar
Xia, M., Li, T., Xu, L., Liu, L. Z. & de Silva, C. W. Fault diagnosis for rotating machinery using multiple sensors and convolutional neural networks. IEEE ASME Trans. Mech. 23, 101–110. https://doi.org/10.1109/tmech.2017.2728371 (2018).
Article Google Scholar
Liu, R. N., Meng, G. T., Yang, B. Y., Sun, C. & Chen, X. F. Dislocated time series convolutional neural architecture: An intelligent fault diagnosis approach for electric machine. IEEE Trans. Ind. Inf. 13, 1310–1320. https://doi.org/10.1109/tii.2016.2645238 (2017).
Article ADS Google Scholar
Jiang, G. Q., He, H. B., Yan, J. & Xie, P. Multiscale convolutional neural networks for fault diagnosis of wind turbine gearbox. IEEE Trans. Ind. Electron. 66, 3196–3207. https://doi.org/10.1109/tie.2018.2844805 (2019).
Article Google Scholar
Zhang, B., Zhang, S. H. & Li, W. H. Bearing performance degradation assessment using long short-term memory recurrent network. Comput. Ind. 106, 14–29. https://doi.org/10.1016/j.compind.2018.12.016 (2019).
Article Google Scholar
Zhao, M. H., Kang, M., Tang, B. P. & Pecht, M. Deep residual networks with dynamically weighted wavelet coefficients for fault diagnosis of planetary gearboxes. IEEE Trans. Ind. Electron. 65, 4290–4300. https://doi.org/10.1109/tie.2017.2762639 (2018).
Article ADS CAS Google Scholar
Wang, H. Q., Li, S., Song, L. Y. & Cui, L. L. A novel convolutional neural network based fault recognition method via image fusion of multi-vibration-signals. Comput. Ind. 105, 182–190. https://doi.org/10.1016/j.compind.2018.12.013 (2019).
Article CAS Google Scholar
Yan, X., Liu, Y., Xu, Y. & Jia, M. Multistep forecasting for diurnal wind speed based on hybrid deep learning model with improved singular spectrum decomposition. Energy Convers. Manag. 225, 1–22. https://doi.org/10.1016/j.enconman.2020.113456 (2020).
Article Google Scholar
Yan, X., Liu, Y. & Jia, M. Health condition identification for rolling bearing using a multi-domain indicator-based optimized stacked denoising autoencoder. Struct. Health Monit. Int. J. 19, 1602–1626. https://doi.org/10.1177/1475921719893594 (2020).
Article Google Scholar
Yan, X., Liu, Y. & Jia, M. Multiscale cascading deep belief network for fault identification of rotating machinery under various working conditions. Knowl. Based Syst. 193, 1–20. https://doi.org/10.1016/j.knosys.2020.105484 (2020).
Article Google Scholar
Li, X., Zhang, W. & Ding, Q. Cross-domain fault diagnosis of rolling element bearings using deep generative neural networks. IEEE Trans. Ind. Electron. 66, 5525–5534. https://doi.org/10.1109/tie.2018.2868023 (2019).
Article Google Scholar
Zhang, R., Tao, H. Y., Wu, L. F. & Guan, Y. Transfer learning with neural networks for bearing fault diagnosis in changing working conditions. IEEE Access 5, 14347–14357. https://doi.org/10.1109/access.2017.2720965 (2017).
Article Google Scholar
Cao, P., Zhang, S. L. & Tang, J. Preprocessing-free gear fault diagnosis using small datasets with deep convolutional neural network-based transfer learning. IEEE Access 6, 26241–26253. https://doi.org/10.1109/access.2018.2837621 (2018).
Article Google Scholar
Wang, Z. R., Wang, J. & Wang, Y. R. An intelligent diagnosis scheme based on generative adversarial learning deep neural networks and its application to planetary gearbox fault pattern recognition. Neurocomputing 310, 213–222. https://doi.org/10.1016/j.neucom.2018.05.024 (2018).
Article Google Scholar
Shao, S. Y., Wang, P. & Yan, R. Q. Generative adversarial networks for data augmentation in machine fault diagnosis. Comput. Ind. 106, 85–93. https://doi.org/10.1016/j.compind.2019.01.001 (2019).
Article Google Scholar
Data castle bearings dataset (2012).
Wang, B., Lei, Y. G., Li, N. P. & Li, N. B. A hybrid prognostics approach for estimating remaining useful life of rolling element bearings. IEEE Trans. Reliab. 69, 401–412. https://doi.org/10.1109/tr.2018.2882682 (2020).
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Tianjin University of Technology, Tianjin, 300384, China
Kaisi Yang & Lianyu Zhao
Tianjin Key Laboratory of Advanced Mechatronic System Design and Intelligent Control, School of Mechanical Engineering, Tianjin University of Technology, Tianjin, 300384, China
Chenglin Wang

Authors

Kaisi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lianyu Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Chenglin Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.Y. conceived the experiments, K.Y. and C.W. conducted the experiments, K.Y. analysed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Chenglin Wang.

Ethics declarations

Competing interests

This work was supported by the Ministry of Science and Technology of the People’s Republic of China (Grant Number [2017YFF0108104]). The authors have no relevant financial or non-financial interests to disclose. The authors have no conflicts of interest to declare that are relevant to the content of this article. All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript. The authors have no financial or proprietary interests in any material discussed in this article.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, K., Zhao, L. & Wang, C. A new intelligent bearing fault diagnosis model based on triplet network and SVM. Sci Rep 12, 5234 (2022). https://doi.org/10.1038/s41598-022-08956-w

Download citation

Received: 23 June 2021
Accepted: 15 March 2022
Published: 28 March 2022
DOI: https://doi.org/10.1038/s41598-022-08956-w

This article is cited by

An imbalance data quality monitoring based on SMOTE-XGBOOST supported by edge computing
- Yan Han
- Zhe Wei
- Guotian Huang
Scientific Reports (2024)
A multi-sensor feature fusion network model for bearings grease life assessment in accelerated experiments
- Zhuocheng Jiang
- Seong Hyeon Hong
- Yi Wang
Neural Computing and Applications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.