Minimalist module analysis for fault detection and localization

Lou, Zhijiang; Wang, Youqing; Lu, Shan; Sun, Pei

doi:10.1038/s41598-021-02676-3

Download PDF

Article
Open access
Published: 07 December 2021

Minimalist module analysis for fault detection and localization

Zhijiang Lou¹,
Youqing Wang²,
Shan Lu¹ &
…
Pei Sun¹

Scientific Reports volume 11, Article number: 23571 (2021) Cite this article

759 Accesses
Metrics details

Subjects

Abstract

Traditional multivariate statistical-based process monitoring (MSPM) methods are effective data-driven approaches for monitoring large-scale industrial processes, but have a shortcoming in handling the redundant correlations between process variables. To address this shortcoming, this study proposes a new MSPM method called minimalist module analysis (MMA). MMA divides process data into several different minimalist modules and one more independent module. All variables in the minimalist module are strongly correlated, and no redundant variables exist; therefore, the extracted feature components in one minimalist module will not be disturbed by noise from the other modules. This study also proposes new monitoring indices and a fault localization strategy for MMA, and simulation tests demonstrate that MMA achieves superior performance in fault detection and localization.

Fault diagnosis of anti-friction bearings based on Bi-dimensional ensemble local mean decomposition and optimized dynamic least square support vector machine

Article Open access 18 October 2023

Monitoring of semiconductor manufacturing process on Bayesian AEWMA control chart under paired ranked set sampling schemes

Article Open access 19 December 2023

An adaptive Bayesian approach for improved sensitivity in joint monitoring of mean and variance using Max-EWMA control chart

Article Open access 30 April 2024

Introduction

Multivariate statistical-based process monitoring (MSPM) methods^1,2,3,4, e.g., principal component analysis (PCA)^{5, 6}, partial least squares (PLS)^{7, 8}, and canonical correlation analysis (CCA)^{9, 10}, are effective data-driven approaches for monitoring large-scale industrial processes. The main idea of MSPM is analyzing the correlation between process variables and extracting the feature components for the construction of statistical indices.

MSPM has been a research hotspot for many years, and a large number of relevant studies are published each year. In recent years, studies have focused on improving the existing methods to deal with process characteristics such as nonlinear, non-Gaussian, and dynamic features. For example, Ge et al.¹¹ combines the multivariate linear Gaussian state-space model with MSPM for handling the dynamic feature during a process; Du et al.¹² proposed the Gaussian distribution transformation (GDT)-based monitoring method for handling the non-Gaussian feature; and Lou et al.¹³ combined artificial neural networks with PCA, and proposed a new neural component analysis for handling nonlinear features. Meanwhile, Zhou et al.¹⁴ proposed a nonlinear key performance indicator (KPI) strategy for the PLS algorithm.

Because MSPM can compress the high-dimensional data into two or three statistical indices, it is a convenient tool for detecting the abnormal condition in the whole process object. To address the fault localization problem, the contribution plot method^{15, 16} was proposed for MSPM, which calculates the contribution of each variable of the original data set and picks the variables with high contributions as fault sources. Most studies on MSPM use the contribution plot as a basic algorithm tool^{17, 18}, and a few studies have proposed improved versions of the MSPM method that cannot use the traditional contribution plot directly (examples include the kernel PCA¹⁹ and robust PCA²⁰).

However, according to actual simulation test results, MSPM is insensitive to specific faults, and the contribution plot method may mistakenly diagnose normal variables as a fault source. The reason for this phenomenon is that the traditional MSPM methods are based on the correlations between all process variables, and some correlations can be deduced by others, which means that these correlations are redundant. As such, the feature components extracted by traditional MSPM methods contain information from many process variables, and hence, are also disturbed by noises from these variables; therefore, traditional MSPM methods are insensitive to specific faults. In addition, the redundant correlations may mislead the contribution plot method, which results in incorrect localization of faults.

For handling these problems, multiblock MSPM methods, such as consensus PCA (CPCA)²¹, multiblock PLS (MBPLS)¹⁸, and hierarchical PLS (HPLS)²², are proposed for reducing the number of variables and improving the interpretability of multivariate models. The main idea of multiblock MSPM methods is dividing the process variables into several blocks and combining the monitoring result of each block. However, block division is still an open problem in academic and engineering fields. Though Slama had given a general guideline “blocks should correspond as closely as possible to distinct units of the process where all the variables within a block or process unit may be highly coupled, but where there is minimal coupling among variables in different blocks”¹⁸, this rule is inappropriate for large-scale industrial processes, because (a) in large-scale industrial processes, variables in different process units are still highly coupled; (b) variables in the same unit may be unrelated. In addition, for multiblock MSPM methods, one variable only belongs to one block, as such, the rest blocks may lose key input variables, which causes large model error. For example, for model in Fig. 1, it’s hard to divide the process variables into two or more blocks: when $x_{3}$ is allocated to block 2, then blocks 1 loses information of $x_{3}$. Besides, it’s difficult to divide the blocks with traditional data-driven method, and hence many multiblock MSPM methods demand the process prior knowledge for block division²³.

To eliminate the influence of the redundant correlations among process data, this paper proposes a novel MSPM method called minimalist module analysis (MMA). All variables in the minimalist module are strongly correlated, and no redundant variables exist. As shown in Fig. 1, MMA just analyzes the correlations between variables in the same module, and hence the extracted feature components are not disturbed by the noise from the other modules. In addition, the modularization analysis results can provide more useful information for fault localization.

The difference between MMA and the multiblock MSPM methods are as follows: first, for MMA, each variable may belong to more than one modules ($x_{1}$ belongs to two modules in Fig. 1), so each module represents one complete correlation without information loss; second, for MMA, module division is based on statistics analysis rather than the process prior knowledge, which is consistent with the data-driven feature of MSPM; third, each module only contain one correlation in MMA, and each block in the multiblock MSPM methods may contain more than one correlations.

The main innovations of this study are as follows. First, we propose a modularization method based on singular value decomposition (SVD)²⁴ and particle swarm optimization (PSO)²⁵, which can divide the process variables into different minimalist modules and an independent module. Then, we propose new monitoring indices for each module. In addition, we propose a new fault localization strategy for MMA.

According to a survey paper¹, PCA is the most commonly used MSPM method. As such, this paper focuses on the comparison of MMA and PCA; our conclusion is also applicable to other algorithms, such as PLS and CCA. The simulation tests in a mathematical model and the Tennessee Eastman (TE) process²⁶ show that MMA can successfully obtain the minimalist modules; moreover, it achieves much better performance than the traditional MSPM methods in fault detection and fault localization.

The remainder of this paper is organized as follows. In “Methods” section, we briefly review some concepts of classical PCA and the contribution plot method, and assess the defects of these methods. “Minimalist module analysis (MMA)” section then proposes MMA for process monitoring, and introduces some details. “Simulation study of MMA” section analyzes the characteristics of MMA, and compares this method with PCA by conducting tests on a mathematical model. “Fault detection in the Tennessee Eastman process” section compares MMA with other improved MSPM methods in the TE process. Lastly, “Conclusions” section summarizes the contributions of this paper, and discusses some directions for future studies.

Methods

Principal component analysis (PCA)

PCA decomposes the data matrix ${\mathbf{X}} \in {\mathbf{R}}^{n \times s}$ (where n is the number of samples, and s is the number of variables) into a transformed k subspace of reduced dimensions as follows:

$${\mathbf{X}} = {\mathbf{TP}}^{{\mathbf{T}}} + {\mathbf{E}} = \hat{\mathbf{X}} + {\mathbf{E}},$$

(1)

where ${\mathbf{T}} \in {\mathbf{R}}^{n \times k}$ refers to the score matrix, which is an orthogonal matrix; ${\mathbf{P}} \in {\mathbf{R}}^{s \times k}$ refers to the loading matrix, and it is orthonormal; and ${\mathbf{E}} \in {\mathbf{R}}^{n \times s}$ is the residual matrix. To obtain the loading matrix ${\mathbf{P}}$, one should firstly calculate the covariance matrix:

$${{\varvec{\Xi}}} = \frac{1}{n - 1}{\mathbf{X}}^{T} {\mathbf{X}}$$

(2)

Then, ${{\varvec{\Xi}}}$ can be presented by singular value decomposition (SVD) as follows:

$${{\varvec{\Xi}}} = {\mathbf{P}}_{0}^{T} {\mathbf{\Lambda P}}_{0} ,$$

(3)

where ${\mathbf{\Lambda = }}\left[ {\begin{array}{*{20}l} {\lambda_{1} } \hfill & 0 \hfill & 0 \hfill & 0 \hfill \\ 0 \hfill & {\lambda_{2} } \hfill & 0 \hfill & 0 \hfill \\ 0 \hfill & 0 \hfill & \ddots \hfill & 0 \hfill \\ 0 \hfill & 0 \hfill & 0 \hfill & {\lambda_{s} } \hfill \\ \end{array} } \right]$ ($\lambda_{1} \ge \lambda_{2} \ge \cdots \lambda_{s} \ge 0$) is a diagonal matrix. Matrix ${\mathbf{P}}$ is actually columns of P₀ associated with the k largest eigenvalues, and k is determined by cumulative percent variance (CPV)²⁷ as follows:

$$CPV = \sum\limits_{i = 1}^{k} {\lambda_{i} } /\sum\limits_{i = 1}^{s} {\lambda_{i} \times 100\% \ge \varepsilon ,}$$

(4)

where $\varepsilon$ is a parameter usually set to 85%. When CPV is larger than $\varepsilon$, we take k as the number of the principal components (PCs).

Then, two statistics are constructed to monitor the new process data sample ${\mathbf{x}} \in {\mathbf{R}}^{1 \times s}$ as follows:

$$\left\{ \begin{gathered} T^{2} = {\mathbf{xP}}({{\varvec{\Lambda}}}_{k} )^{ - 1} {\mathbf{P}}^{T} {\mathbf{x}}^{T} \hfill \\ SPE = ({\mathbf{x}} - {\hat{\mathbf{x}}})({\mathbf{x}} - {\hat{\mathbf{x}}})^{T} \hfill \\ \end{gathered} \right.,$$

(5)

where ${\hat{\mathbf{x}}} = {\mathbf{TP}}^{T} = {\mathbf{xPP}}^{T}$ and ${{\varvec{\Lambda}}}_{k} { = }\left[ {\begin{array}{*{20}c} {\lambda_{1} } & 0 & 0 & 0 \\ 0 & {\lambda_{2} } & 0 & 0 \\ 0 & 0 & \ddots & 0 \\ 0 & 0 & 0 & {\lambda_{k} } \\ \end{array} } \right]$ $\left( {\lambda_{1} \ge \lambda_{2} \ge \cdots \lambda_{k} \ge 0} \right)$. The thresholds for the two indices, $\delta_{{T^{2} }}$ and $\delta_{SPE}$, can be found in reference²⁸.

Contribution plot

The contributions to SPE are calculated as follows:

$$Con SPE_{j} = ({\mathbf{x}}_{j} - {\hat{\mathbf{x}}}_{j} )({\mathbf{x}}_{j} - {\hat{\mathbf{x}}}_{j} )^{T} ,$$

(6)

where ${\mathbf{x}}_{j}$ and ${\hat{\mathbf{x}}}_{j}$ are the jth columns of x and ${\hat{\mathbf{x}}}$, respectively. The contributions to $T^{2}$ are calculated as follows:

$$ConT_{j}^{2} = \sum\limits_{i = 1}^{k} {\left( {{\mathbf{x}}_{j} - {\hat{\mathbf{x}}}_{j} } \right){\mathbf{P}}_{j,i} \lambda_{i}^{ - 1} } {\mathbf{P}}_{i}^{T} {\mathbf{x}}^{T} ,$$

(7)

where P_i is the ith column of P, and P_j,i is the element in the jth column and ith row.

The role of the contribution plots to fault isolation is to indicate which of the variables are related to the fault rather than to reveal the actual cause of it. In general, variables with a higher contribution have a closer relationship with the fault source. The thresholds of and can be obtained by kernel density estimation²⁹.

Drawback of PCA and contribution plot method

Theorem

The redundant variables introduce extra noise into the principal components (PCs).

Proof

Assume ${\mathbf{X}}_{1} \in {\mathbf{R}}^{n \times s}$ are the variables belonging to a minimalist module, which can be full-rank decomposed as

$${\mathbf{X}}_{1} = {\mathbf{T}}_{0} {\mathbf{P}}_{0}^{T} ,$$

(8)

where ${\mathbf{T}}_{0} \in {\mathbf{R}}^{n \times s}$ and ${\mathbf{P}}_{0} \in {\mathbf{R}}^{s \times s}$ Matrix ${\mathbf{X}}_{2} \in {\mathbf{R}}^{{n \times s^{\prime}}}$ are the redundant variables that can be presented as the linear combination of X₁ as follows:

$${\mathbf{X}}_{2} = {\mathbf{X}}_{1} {\mathbf{R}} + {\mathbf{W}},$$

(9)

where ${\mathbf{R}} \in {\mathbf{R}}^{{s \times s{\prime }}}$. is the linear transformation matrix, and ${\mathbf{W}} \in {\mathbf{R}}^{{n \times s{\prime }}}$ is noise belonging to X₂. In this paper, we assume that each measurement variable contains independent sensor noise, and hence, rank(W) = s′.

Taking ${\mathbf{X}} = \left[ {\begin{array}{*{20}c} {{\mathbf{X}}_{1} } & {{\mathbf{X}}_{2} } \\ \end{array} } \right]$, one obtains

$${\mathbf{X}} = {\mathbf{T}}_{0} {\mathbf{P}}_{0}^{T} \left[ {\begin{array}{*{20}c} {\mathbf{I}} & {\mathbf{R}} \\ \end{array} } \right] + \left[ {\begin{array}{*{20}c} {\mathbf{0}} & {\mathbf{W}} \\ \end{array} } \right].$$

(10)

Part ${\mathbf{T}}_{0} {\mathbf{P}}_{0}^{T} \left[ {\begin{array}{*{20}l} {\mathbf{I}} \hfill & {\mathbf{R}} \hfill \\ \end{array} } \right]$ can be full-rank singular value decomposed as

$${\mathbf{T}}_{0} {\mathbf{P}}_{0}^{T} \left[ {\begin{array}{*{20}c} {\mathbf{I}} & {\mathbf{R}} \\ \end{array} } \right] = {\mathbf{T}}_{1} {\mathbf{P}}_{1}^{T} ,$$

(11)

where ${\mathbf{T}}_{1} \in {\mathbf{R}}^{{n \times \left( {s + s^{\prime}} \right)}}$, rank(T₁) = rank(T₀), and ${\mathbf{P}}_{1} \in {\mathbf{R}}^{{(s + s{\prime }) \times \left( {s + s^{\prime}} \right)}}$. Hence, one obtains

$${\mathbf{X}} = {\mathbf{T}}_{1} {\mathbf{P}}_{1}^{T} + \left[ {\begin{array}{*{20}c} {\mathbf{0}} & {\mathbf{W}} \\ \end{array} } \right]{ = }\left( {{\mathbf{T}}_{1} { + }\left[ {\begin{array}{*{20}c} {\mathbf{0}} & {\mathbf{W}} \\ \end{array} } \right]{\mathbf{P}}_{1} } \right){\mathbf{P}}_{1}^{T} .$$

(12)

Taking ${\mathbf{P}}_{1} = \left[ {\begin{array}{*{20}c} {{\mathbf{P^{\prime}}}_{1} \in {\mathbf{R}}^{{s \times \left( {s + s^{\prime}} \right)}} } \\ {{\mathbf{P^{\prime\prime}}}_{1} \in {\mathbf{R}}^{{s^{\prime} \times \left( {s + s^{\prime}} \right)}} } \\ \end{array} } \right]$,

$${\mathbf{X}} = \left( {{\mathbf{T}}_{1} { + }{\mathbf{WP^{\prime\prime}}}_{1} } \right){\mathbf{P}}_{1}^{T} .$$

(13)

Because part $\left( {{\mathbf{T}}_{1} { + }{\mathbf{WP}}_{1}^{{\prime \prime }} } \right)$ is non-orthogonal in most situations, we introduce another orthonormal matrix ${\mathbf{Q}} \in {\mathbf{R}}^{{\left( {s + s^{\prime}} \right) \times \left( {s + s^{\prime}} \right)}}$, which makes

$$\left\{ {\begin{array}{*{20}l} {{\mathbf{X}} = {\mathbf{T}}_{2} {\mathbf{P}}_{2}^{T} } \hfill \\ {{\mathbf{T}}_{2} {\text{ = }}\left( {{\mathbf{T}}_{1} {\text{ + }}{\mathbf{WP}}_{1}^{{\prime \prime }} } \right){\mathbf{Q}}} \hfill \\ {{\mathbf{P}}_{2} {\text{ = }}{\mathbf{P}}_{1} {\mathbf{Q}}} \hfill \\ \end{array} } \right.$$

(14)

It should be noted that when $\left( {{\mathbf{T}}_{1} { + }{\mathbf{WP}}_{1}^{{\prime \prime }} } \right)$ is orthogonal, then Q = I.

PCA picks the k largest components of T₂ as PCs, and we denote them as ${\mathbf{T}}_{k} \in {\mathbf{R}}^{n \times k}$. Then,

$${\mathbf{T}}_{k} { = }\left( {{\mathbf{T}}_{1} { + }{\mathbf{WP}}_{1}^{{\prime \prime }} } \right){\mathbf{Q}}_{k} = {\mathbf{T}}_{1} {\mathbf{Q}}_{k} + {\mathbf{WP}}_{1}^{{\prime \prime }} {\mathbf{Q}}_{k} ,$$

(15)

where ${\mathbf{Q}}_{k} \in {\mathbf{R}}^{{(s + s^{\prime}) \times k}}$ is the corresponding k columns of ${\mathbf{Q}}$. Taking ${{\varvec{\Pi}}}{ = }{\mathbf{P}}_{1}^{{\prime \prime }} {\mathbf{Q}}_{k} \in {\mathbf{R}}^{{s^{\prime} \times k}}$, and because ${\mathbf{P^{\prime\prime}}}$ and ${\mathbf{Q}}_{k}$ are parts of orthonormal matrices ${\mathbf{P}}_{1}$ and ${\mathbf{Q}}$, one obtains ${{\varvec{\Pi}}} \ne {\mathbf{0}}$($rank\left( {{\varvec{\Pi}}} \right) \ne 0$) unless the exceptionally rare situation that all columns of Q_k belong to the column set of ${\mathbf{P^{\prime}}}_{1}^{T}$. As $rank\left( {\mathbf{W}} \right){ + }rank\left( {{\varvec{\Pi}}} \right) > s^{\prime}$, one obtains ${\mathbf{WP}}_{1}^{{\prime \prime }} {\mathbf{Q}}_{k} \ne 0$.

As such, T_k is influenced by W, and the redundant variables X₂ introduce extra noise W into the principal components (PCs). This finishes the proof. Based on the Theorem, one finds that PCA is not good at handling process data with redundant variables.

As for the contribution plot method, according to Eqs. (6) and (7), it is based on the difference between x and ${\hat{\mathbf{x}}}$. As shown in Fig. 2, when a fault occurs in a specific variable ${\mathbf{x}}_{j}$, (a) according to equation ${\mathbf{T}}{ = }{\mathbf{xP}}$, the relevant principal components are faulty; (b) according to equation ${\hat{\mathbf{x}}}{ = }{\mathbf{TP}}^{T}$, most reconstructed variables are faulty. As such, in a practical engineering application, it is hard to locate the source fault by the contribution plot method because too many variables’ contribution indices alarm the fault.

Section summary

In sum, to eliminate the noise disturbance in the redundant variables, and to improve the fault localization ability, we develop a new monitoring algorithm based on the minimalist module and propose a corresponding fault localization strategy in “Minimalist module analysis (MMA)” section.

Minimalist module analysis (MMA)

The content of this section is listed in Fig. 3 below.

Minimalist module division

Traditional PCA approaches focus on the k largest eigenvalues in matrix ${{\varvec{\Lambda}}}$, and the important information contained in the residual part is not used. When $\varepsilon$ is very small (e.g., 0.05), one obtains $\lambda_{j} \approx 0{\kern 1pt} \left( {j = k + 1.k + 2, \ldots ,s} \right)$. Taking P_r as the columns of P₀ associated with the s-k smallest eigenvalues, one obtains

$${\mathbf{XP}}_{r} \approx {\mathbf{0}}.$$

(16)

We assume ${\mathbf{X}} = \left[ {\begin{array}{*{20}c} {x_{1} } & {x_{2} } & {x_{3} } \\ \end{array} } \right]$, and ${\mathbf{P}}_{r} { = }\left[ {\begin{array}{*{20}c} {{\mathbf{P}}_{1,1} } & {{\mathbf{P}}_{1,2} } \\ {{\mathbf{P}}_{2,1} } & {{\mathbf{P}}_{2,2} } \\ {{\mathbf{P}}_{3,1} } & {{\mathbf{P}}_{3,2} } \\ \end{array} } \right]$. Then,

$$\left\{ \begin{gathered} x_{1} {\mathbf{P}}_{1,1} { + }x_{2} {\mathbf{P}}_{2,1} { + }x_{3} {\mathbf{P}}_{3,1} \approx 0 \hfill \\ x_{1} {\mathbf{P}}_{1,2} { + }x_{2} {\mathbf{P}}_{2,2} { + }x_{3} {\mathbf{P}}_{3,2} \approx 0 \hfill \\ \end{gathered} \right..$$

(17)

Through the transformation of Eq. (17), one obtains

$$\begin{aligned} & \left( {x_{1} {\mathbf{P}}_{1,1} { + }x_{2} {\mathbf{P}}_{2,1} { + }x_{3} {\mathbf{P}}_{3,1} } \right){\mathbf{P}}_{1,2} { - }\left( {x_{1} {\mathbf{P}}_{1,2} { + }x_{2} {\mathbf{P}}_{2,2} { + }x_{3} {\mathbf{P}}_{3,2} } \right){\mathbf{P}}_{1,1} \\ & { = }x_{2} \left( {{\mathbf{P}}_{2,1} {\mathbf{P}}_{1,2} { - }{\mathbf{P}}_{2,2} {\mathbf{P}}_{1,1} } \right){ + }x_{3} \left( {{\mathbf{P}}_{3,1} {\mathbf{P}}_{1,2} { - }{\mathbf{P}}_{3,2} {\mathbf{P}}_{1,1} } \right) \approx 0. \\ \end{aligned}$$

(18)

As such, one then obtains

$$\left\{ \begin{gathered} {\tilde{\mathbf{P}}}_{r} = \left[ {\begin{array}{*{20}c} 0 \\ {{\mathbf{P}}_{2,1} {\mathbf{P}}_{1,2} { - }{\mathbf{P}}_{2,2} {\mathbf{P}}_{1,1} } \\ {{\mathbf{P}}_{3,1} {\mathbf{P}}_{1,2} { - }{\mathbf{P}}_{3,2} {\mathbf{P}}_{1,1} } \\ \end{array} } \right] = {\mathbf{P}}_{r} \left[ {\begin{array}{*{20}c} {{\mathbf{P}}_{1,2} } \\ {{ - }{\mathbf{P}}_{1,1} } \\ \end{array} } \right] \hfill \\ {\mathbf{X}}\tilde{\mathbf{P}}_{r} \approx {\mathbf{0}} \hfill \\ \end{gathered} \right.$$

(19)

Unlike ${\mathbf{P}}_{r}$, some elements of ${\tilde{\mathbf{P}}}_{r}$ are 0, and hence Eq. (19) can describe the relationship between x₂ and x₃ without considering x₁. In Eq. (19), variable set $\left[ {\begin{array}{*{20}l} {x_{2} } \hfill & {x_{3} } \hfill \\ \end{array} } \right]$ is a minimalist module.

The flow of minimalist module division is as follows:

(a)
Find a transformation matrix ${{\varvec{\Gamma}}} \in {\mathbf{R}}^{{\left( {s{ - }k} \right) \times \left( {s - k} \right)}}$ that maximizes the number of 0 elements in ${\tilde{\mathbf{P}}}_{r} = {\mathbf{P}}_{r} {{\varvec{\Gamma}}}$. This paper addresses this optimization problem by using the particle swarm optimization (PSO)³⁰ algorithm as described below.

Step 1 Set num = 1.

Step 2 Take the $num^{th}$ column of P_r as ${{\varvec{\Psi}}}_{{\mathbf{1}}}$ and the remaining s − k − 1 columns as ${{\varvec{\Psi}}}_{{\mathbf{2}}}$. Solve the following optimization function by PSO:

$$\mathop {Minimize}\limits_{{{{\varvec{\Gamma}}}_{num} }} \left( {\left\| {{{\varvec{\Psi}}}_{1} - {{\varvec{\Psi}}}_{2} {{\varvec{\Gamma}}}_{num} } \right\|_{2} { - }\left\| {{{\varvec{\Psi}}}_{1} - {{\varvec{\Psi}}}_{2} {{\varvec{\Gamma}}}_{num} } \right\|_{\beta } } \right),$$

(20)

where $\left\| {{{\varvec{\Psi}}}_{1} - {{\varvec{\Psi}}}_{2} {{\varvec{\Gamma}}}_{num} } \right\|_{\beta }$ denotes the number of elements in interval $\left[ { - \beta ,\beta } \right]$ ($\beta$ is close to 0, such as 0.01).

Step 3 If $num = s - k$, go to step 4; else, num = num + 1 and go to step 2.

Step 4 ${\mathbf{\Gamma = {\rm I} - }}\left[ {\begin{array}{*{20}l} {{{\varvec{\Gamma}}}_{1} } \hfill & {{{\varvec{\Gamma}}}_{2} } \hfill & \ldots \hfill & {{{\varvec{\Gamma}}}_{s - k} } \hfill \\ \end{array} } \right]$.

(b)
Calculate ${\tilde{\mathbf{P}}}_{r} { = }{\mathbf{P}}_{r} {{\varvec{\Gamma}}}$, adjust each column of ${\tilde{\mathbf{P}}}_{r}$ to unit variance, and set all elements in interval $\left[ {{ - }\beta \beta } \right]$ to 0.
(c)
Take the variables corresponding to non-zero element parameters in the ith ($i = 1,2, \ldots ,s - k$) column of ${\tilde{\mathbf{P}}}_{r}$ as the ith minimalist module (MMi).

Remark

The form of the minimalist module is not unique, e.g. through the transformation of Eq. (17), one also obtains.

$$\begin{aligned} & \left( {x_{1} {\mathbf{P}}_{1,1} { + }x_{2} {\mathbf{P}}_{2,1} { + }x_{3} {\mathbf{P}}_{3,1} } \right){\mathbf{P}}_{3,2} { - }\left( {x_{1} {\mathbf{P}}_{1,2} { + }x_{2} {\mathbf{P}}_{2,2} { + }x_{3} {\mathbf{P}}_{3,2} } \right){\mathbf{P}}_{3,1} \\ & { = }x_{1} \left( {{\mathbf{P}}_{1,1} {\mathbf{P}}_{3,2} { - }{\mathbf{P}}_{1,2} {\mathbf{P}}_{3,1} } \right){ + }x_{2} \left( {{\mathbf{P}}_{2,1} {\mathbf{P}}_{3,2} { - }{\mathbf{P}}_{2,2} {\mathbf{P}}_{3,1} } \right) \approx 0, \\ \end{aligned}$$

and hence variable set $\left[ {\begin{array}{*{20}c} {x_{1} } & {x_{2} } \\ \end{array} } \right]$ is also a minimalist module. As such, the result of PSO may be different each time.

Independent module

Each variable in the minimalist module is strongly correlated with other variables. As such, some variables, such as x₈ and x₉ in Fig. 3, are not included in the minimalist module group. Thus, these variables belong to the independent module.

Monitoring indices construction

Each minimalist module can be monitored by the PCA algorithm independently. We assume that ${\tilde{\mathbf{X}}}_{i} \in {\mathbf{R}}^{{n \times \tilde{s}}}$ are data belonging to MMi. Then, $rank\left( {{\tilde{\mathbf{X}}}_{i} } \right) \in \tilde{s} - 1$ because each minimalist module represents one independent correlation, and hence the number of PCs for each minimalist module is fixed as $\tilde{s} - 1$. The monitoring indices of each module are calculated as

$$T_{{M_{i} }}^{2} { = }{{T_{i}^{2} } \mathord{\left/ {\vphantom {{T_{i}^{2} } {\delta_{{T_{i}^{2} }} }}} \right. \kern-\nulldelimiterspace} {\delta_{{T_{i}^{2} }} }},$$

(21)

and

$$SPE_{{M_{i} }} { = }{{\left( {{{SPE_{i} } \mathord{\left/ {\vphantom {{SPE_{i} } {T_{{M_{i} }}^{2} }}} \right. \kern-\nulldelimiterspace} {T_{{M_{i} }}^{2} }}} \right)} \mathord{\left/ {\vphantom {{\left( {{{SPE_{i} } \mathord{\left/ {\vphantom {{SPE_{i} } {T_{{M_{i} }}^{2} }}} \right. \kern-\nulldelimiterspace} {T_{{M_{i} }}^{2} }}} \right)} {\delta_{{SPE_{i} }} }}} \right. \kern-\nulldelimiterspace} {\delta_{{SPE_{i} }} }},$$

(22)

where $T_{i}^{2}$ and $SPE_{i}$, and $\delta_{{T_{i}^{2} }}$ and $\delta_{{SPE_{i} }}$ are the $T^{2}$ and SPE indices and the corresponding thresholds for MMi, respectively. Different from the traditional SPE index, SPE_i divides $T_{{M_{i} }}^{2}$ to eliminate the impact of $T_{{M_{i} }}^{2}$ on SPE_i.

The indices for the whole process are

$$T_{M}^{2} { = }\sum\limits_{i = 1}^{s - k} {\left( {1 + \gamma * sign\left( {T_{{M_{i} }}^{2} - 1} \right)} \right)T_{{M_{i} }}^{2} } ,$$

(23)

and

$$SPE_{M} { = }\sum\limits_{i = 1}^{s - k} {\left( {1 + \gamma * sign\left( {SPE_{{M_{i} }} - 1} \right)} \right)SPE_{{M_{i} }} } ,$$

(24)

where $\gamma$ is a positive value (e.g., $\sqrt {s - k}$). As such, when some minimalist module detects the fault, then these two indices are much larger than their normal values. The threshold for both indices is $s - k$.

As for the variables in the independent module, they can be monitored by the $T^{2}$ index, which is denoted as $T_{I}^{2}$.

Fault localization

For MMA, the fault localization rules are different for $T_{M}^{2}$, $SPE_{M}$, and $T_{I}^{2}$ indices.

(a)
For the $T_{M}^{2}$ index, when $T_{{M_{i} }}^{2}$ is normal, then all related variables are normal. For example, in the mathematical model in Fig. 3, when $T_{{M_{1} }}^{2}$ and $T_{{M_{2} }}^{2}$ are faulty, and $T_{{M_{3} }}^{2}$ and $T_{{M_{4} }}^{2}$ are normal, then one gets that: (a) variables related to MM1 and MM2, i.e., $x_{1}$, $x_{2}$, $x_{3}$, $x_{4}$, and $x_{5}$, may be faulty; (b) all variables related to MM3 and MM4, i.e., $x_{1}$, $x_{4}$, $x_{5}$, $x_{6}$, and $x_{7}$, are normal; (c) $x_{3}$ must be faulty because it is the only common variable shared by MM1 and MM2, and x₂ may also be faulty because we have no more information for judging it.
(b)
For the $SPE_{M}$ index, when $SPE_{{M_{i} }}$ is faulty, then the correlation between all variables in MMi maybe faulty. For example, in the mathematical model in Fig. 3, one obtains $SPE_{{M_{1} }} { = }\left( {x_{1} { + }x_{2} - x_{3} } \right)^{2} \approx 0$; when the correlation between $x_{1}$, $x_{2}$, and x₃ changes to $x_{1} - x_{2} = x_{3}$ or $x_{1} + 2 * x_{2} = x_{3}$, then $SPE_{{M_{1} }} = \left( {x_{1} + x_{2} - x_{3} } \right)^{2} \ne 0$ and $SPE_{{M_{i} }}$ alarms the fault.
(c)
When a fault occurs in variables not belonging to the minimalist module, such as x₈ and $x_{9}$, then they can only be handled with the detection result of the independent module, i.e., the contribution $Con T_{j}^{2}$.

Simulation study of MMA

This section aims to study the performance of MMA through simulation tests, and compare it with PCA and mutual information–multiblock PCA (MI-MBPCA)³¹. MI-MBPCA employs mutual information to divide the block automatically and hence it does not need the process prior knowledge for block division. The test model is shown below:

$$\left\{ {\begin{array}{*{20}l} {x_{1} = N_{1} + 0.01 \times \omega_{1} } \hfill \\ {x_{2} = N_{2} + 0.01 \times \omega_{2} } \hfill \\ {x_{3} = x_{1} + x_{2} + 0.01 \times \omega_{3} } \hfill \\ {x_{4} = N_{3} + 0.01 \times \omega_{4} } \hfill \\ {x_{5} = x_{3} + x_{5} + 0.01 \times \omega_{5} } \hfill \\ {x_{6} = x_{5} + x_{1} + 0.01 \times \omega_{6} } \hfill \\ {x_{7} = x_{4} + 0.01 \times \omega_{7} } \hfill \\ {x_{8} = N_{4} + 0.01 \times \omega_{8} } \hfill \\ {x_{9} = N_{5} + 0.01 \times \omega_{9} } \hfill \\ \end{array} } \right..$$

Random variables N_i and $\omega_{i}$ follow the standard Gaussian distribution, and $\omega_{i}$ indicates the process noise. Approximately 10,000 normal observations are produced for offline modeling.

After data normalization, the training data are adjusted to zero-mean and unit-variance. Then the normalized data are processed by MMA. The matrix ${\tilde{\mathbf{P}}}_{r}$ is obtained as follows:

${\tilde{\mathbf{P}}}_{r} { = }\left[ {\begin{array}{*{20}r} \hfill 0 & \hfill {0.53} & \hfill 0 & \hfill {0.39} \\ \hfill 0 & \hfill {0.50} & \hfill 0 & \hfill 0 \\ \hfill 0 & \hfill { - 0.68} & \hfill {0.60} & \hfill 0 \\ \hfill {0.69} & \hfill 0 & \hfill {0.39} & \hfill 0 \\ \hfill 0 & \hfill 0 & \hfill { - 0.70} & \hfill {0.45} \\ \hfill 0 & \hfill 0 & \hfill 0 & \hfill { - 0.80} \\ \hfill { - 0.72} & \hfill 0 & \hfill 0 & \hfill 0 \\ \hfill 0 & \hfill 0 & \hfill 0 & \hfill 0 \\ \hfill 0 & \hfill 0 & \hfill 0 & \hfill 0 \\ \end{array} } \right]$.

Thus, MMA successfully obtains four minimalist modules: $\left\{ {x_{1} } \quad {x_{2} } \quad {x_{3} } \right\}$, $\left\{ {x_{3} } \quad {x_{4} } \quad {x_{5} } \right\}$, $\left\{ {x_{1} } \quad {x_{5} } \quad{x_{6} } \right\}$, and $\left\{ {x_{4} } \quad {x_{7} } \right\}$. Then, the independent module is $\left\{ {x_{8} } \quad {x_{9} } \right\}$.

And MI-MBPCA divides the process variables into the following 5 blocks: $\left\{ {x_{1} } \right\}$, $\left\{ {x_{2}} \quad {x_{3}} \quad {x_{5}} \quad{x_{6}} \right\}$, $\left\{ {x_{4} } \quad {x_{7} } \right\}$, $\left\{ {x_{8} } \right\}$, $\left\{ {x_{9} } \right\}$, which is not consistent with the process model because $x_{1}$ is correlated with both $x_{3}$ and $x_{6}$ but they do not belong to same block.

To compare the monitoring performance between MMA, PCA and MI-MBPCA, five test data sets are generated. Each data set contains 960 samples, and the fault occurs at the 160th sample point. The occurred faults are of the following five types:

Fault 1: a step change with amplitude of 5 in x₁;

Fault 2: term N₂ in the expression of x₂ changes to $3 * N_{2}$;

Fault 3: a step change with amplitude of 0.2 in x₃;

Fault 4: term $x_{3} + x_{4}$ in the expression of $x_{5}$ changes to $x_{3} + 2 * x_{4}$;

Fault 5: a step change with amplitude of 5 in $x_{8}$.

The detection results are listed in Table 1. The false alarm rate is calculated as $\frac{\text{the number of faults detected before 160}}{160}$ and the detection rate is calculated as $\frac{\text{the number of faults detected between 161 and 960}}{800}$. In this study, all control limits are based on a probability of 99% and the best result is marked in bold.

Table 1 False alarm rates (%) and detection rates (%) of the principal component analysis (PCA) method, the mutual information–multiblock PCA (MI-MBPCA), and the minimalist module analysis (MMA) method.

Full size table

As shown in Table 1, the performance of MMA is better than that of PCA and MI-MBPCA for all five faults. Because MMA divides the whole process data into several minimalist modules and an independent module, and the noise in each variable will not disturb the unrelated modules, MMA is more robust to process noise than PCA. For MI-MBPCA, because each variable only belongs to one block and the rest blocks may lose key information, the models of blocks maybe biased. One interesting finding in Table 1 is that MMA can successfully detect faults 3 and 4 while PCA fails. The reason for this phenomenon is that PCA monitors the complex correlations between all variables together while MMA monitors each strong correlation (one minimalist module) independently; therefore, MMA is very sensitive to changes in specific correlations.

The fault localization results of the two algorithms for faults 3 and 5 are shown in Figs. 4 and 5, respectively. In Fig. 4, for PCA, $Con SPE_{3}$, $Con SPE_{5}$, and $Con SPE_{6}$ alarm the fault, and we cannot locate the fault source. For MI-MBPCA, because $x_{6}$ is influenced by $x_{5}$, both variables alarm the fault and we cannot locate the fault source. For MMA, all $ConT_{i}^{2}$ and $T_{{M_{i} }}^{2}$ indices are normal, which means that all variables in the independent module are normal and all variables in the minimalist modules fluctuate within the normal range; because $SPE_{{M_{2} }}$ signals a fault alarm, one finds that the correlations between $x_{1}$, $x_{2}$, and $x_{3}$ are changed.

In Fig. 5, although a fault occurs in $x_{8}$, most $Con SPE_{i}$ indices in PCA signal a fault alarm, and we cannot locate the fault source. For MI-MBPCA, it can successfully locate the fault source. However, because MI-MBPCA fails in detecting fault 5, and hence the fault localization step is skipped, as such, MI-MBPCA also fails in locating the fault source. For MMA, all $ConT_{i}^{2}$ and $SPE_{{M_{i} }}$ are normal, and hence one finds that the fault is not in the minimalist modules; only $Con T_{8}^{2}$ signals a fault alarm, and hence MMA successfully locates the faulty variable $x_{8}$.

Fault detection in the Tennessee Eastman process

The Tennessee Eastman (TE) process³² simulation is the most widely used simulation model to test the MSPM methods, which is outlined in Fig. 6. The TE process uses 12 manipulated variables, 22 continuous process measurements, and 19 composition measurements sampled less frequently to simulate a classical chemical process. Because the 19 composition measurements are difficult to measure in real time and one manipulated variable, i.e., the agitation speed, is not manipulated, this study only monitors the other 22 measurements and 11 manipulated variables, as listed in Table 2. Twenty-one programmed faults that are introduced in the TE process are listed in Table 3. In this study, 960 normal samples are adopted as training data to construct the monitoring models. Each testing data set contains 960 samples, and fault occurs at the 161st sample.

Table 2 Monitored variables in the Tennessee Eastman process³³.

Full size table

Table 3 Descriptions of faults in the Tennessee Eastman process³³.

Full size table

In this section, we compare MMA with PCA, MI-MBPCA, Deep principal component analysis (DePCA)³⁴, and kernel dynamic PCA (KDPCA)³⁵; the latter two methods are improved versions of PCA. The detection results of the four methods are listed in Table 4. The false alarm rate is calculated as the $\frac{\text{the number of faults detected before 160}}{160}$, and the detection rate is calculated as $\frac{\text{the number of faults detected between 161 and 960}}{800}$. In this study, all control limits are based on a probability of 99% and the best result is marked in bold.

Table 4 False alarm rates (%) and detection rates (%) of the four fault detection methods.

Full size table

As shown in Table 4, we find that MMA, MI-MBPCA, and PCA achieve similar false alarm rates, and their values are much lower than those of the two improved PCA methods (over 10%). For fault detection rates, MMA achieves the best results in 17 of the 21 faults; as for the remaining 4 faults, MMA’s detection rates are not as high as those of DePCA only because DePCA sacrifices the false alarm rate. An eye-catching result is obtained in the case of fault 5: the detection rates of the compared methods are generally below 50%, whereas MMA achieves a 100.0% detection rate, which indicates the superiority of MMA. In addition, the performance of MMA in faults 10, 16, 19, and 20 is much better than that of the other four methods.

As the papers that proposed DePCA and KDPCA did not give a description of the contribution plot construction, we only compare the fault localization ability between PCA, MI-MBPCA, and MMA. The matrix ${\tilde{\mathbf{P}}}_{r}$ of MMA is shown in Table 5.

Table 5 Matrix ${\tilde{\mathbf{P}}}_{r}$ for the Tennessee Eastman process.

Full size table

Figure 7 shows the fault localization results of fault 4. According to Table 3, fault 4 is a step change in inlet temperature of reactor cooling water. As depicted in Fig. 6, the reactor temperature (variable 9 in Table 2) changes, and hence the reactor cooling water flow (variable 32 in Table 2) also changes to compensate for the temperature change. For PCA, $Con SPE_{9}$, $Con SPE_{32}$, and $Con T_{32}^{2}$ signal a fault alarm; for MI-MBPCA, about 14 variables alarm the fault and it fails in locating the fault source; for MMA, $T_{{M_{1} }}^{2}$, $T_{{M_{6} }}^{2}$, $T_{{M_{8} }}^{2}$, and $T_{{M_{13} }}^{2}$ signal a fault alarm based on the fault localization rules presented in “Monitoring indices construction” section, and then one finds that variables 9 and 32 are faulty. Both PCA and MMA can locate this fault. Different from the contribution plot method of PCA, all $SPE_{{M_{i} }}$ of MMA are normal, which tells the engineers that the correlation between variables have not changed, and hence the fault source is the change in amplitude of some variables. Thus, it can be seen that, compared with PCA, MMA can provide more useful information for fault localization.

Conclusions

In this study, a new MSPM called MMA was proposed to overcome the shortcoming of the traditional MSPM method in handling the redundant correlations among process variables.

The superiority of MMA was verified by several simulation tests. It achieved much better detection performance for five different types of faults on a mathematical model test, and two of which could not be detected by PCA and MI-MBPCA. MMA also had a better performance than other improved MSPM algorithms for 17 of the 21 faults in the Tennessee Eastman process.

MMA is a completely new method, and hence much work can be done based on it. First, we can combine it with the traditional nonlinear, dynamic, robust strategy to improve its fault detection ability. We can also combine it with the traditional contribution plot method to improve its fault localization ability. Moreover, we can combine it with the key performance indicator¹⁴ monitoring strategy. All of these investigations will be part of our future work.

References

Wang, Y., Si, Y., Huang, B. & Lou, Z. Survey on the theoretical research and engineering applications of multivariate statistics process monitoring algorithms: 2008–2017. Can. J. Chem. Eng. 96, 2073 (2018).
Article CAS Google Scholar
Silva, A. F. et al. In-depth evaluation of data collected during a continuous pharmaceutical manufacturing process: A multivariate statistical process monitoring approach. J. Pharm. Sci. 108(1), 439–450 (2019).
Article CAS Google Scholar
Chen, Q., Liu, Z., Ma, X. & Wang, Y. Artificial neural correlation analysis for performance-indicator-related nonlinear process monitoring. IEEE Trans. Ind. Inform. PP(99), 1–1 (2021).
Google Scholar
Qin, Y., Yan, Y., Ji, H. & Wang, Y. Recursive correlative statistical analysis method with sliding windows for incipient fault detection. IEEE Trans. Ind. Electron. PP(99), 1–1 (2021).
Google Scholar
Yu, H., Khan, F. & Garaniya, V. An alternative formulation of PCA for process monitoring using distance correlation. Ind. Eng. Chem. Res. 55(3), 656–669 (2016).
Article CAS Google Scholar
Cui, P., Zhan, C. & Yang, Y. Improved nonlinear process monitoring based on ensemble KPCA with local structure analysis. Chem. Eng. Res. Des. 142, 355–368 (2019).
Article CAS Google Scholar
Fazai, R., Mansouri, M., Abodayeh, K., Nounou, H. & Nounou, M. Online reduced kernel PLS combined with GLRT for fault detection in chemical systems. Process Saf. Environ. Protect. 128, 228–243 (2019).
Article CAS Google Scholar
Dong, J., Zhang, K., Huang, Y., Li, G. & Peng, K. Adaptive total PLS based quality-relevant process monitoring with application to the Tennessee Eastman process. Neurocomputing 154(C), 77–85 (2015).
Article Google Scholar
Jiang, Q., Gao, F., Yi, H. & Yan, X. Multivariate statistical monitoring of key operation units of batch processes based on time-slice CCA. IEEE Trans. Control Syst. Technol. 27(3), 1368–1375 (2018).
Article Google Scholar
Via, J., Santamaria, I. & Perez, J. Canonical correlation analysis (CCA) algorithms for multiple data sets: Application to blind SIMO equalization, in Signal Processing Conference, 2005 European, pp. 1–4 (2015).
Cong, Y., Zhou, L., Song, Z. & Ge, Z. Multirate dynamic process monitoring based on multirate linear Gaussian state-space model. IEEE Trans. Autom. Sci. Eng. Appl. Artif. Intell. 16(4), 1708–1719 (2019).
Article Google Scholar
Du, W., Zhang, Y. & Zhou, W. Modified non-Gaussian multivariate statistical process monitoring based on the Gaussian distribution transformation. J. Process Control 85, 1–14 (2020).
Article CAS Google Scholar
Lou, Z. & Wang, Y. New nonlinear approach for process monitoring: Neural component analysis. Ind. Eng. Chem. Res. 60, 387 (2021).
Article CAS Google Scholar
Si, Y., Wang, Y. & Zhou, D. Key-performance-indicator-related process monitoring based on improved kernel partial least squares. IEEE Trans. Ind. Electron. 68, 2626 (2021).
Article Google Scholar
Kourti, T. Application of latent variable methods to process control and multivariate statistical process control in industry. Int. J. Adapt. Control Signal Process. 19(4), 213–246 (2005).
Article MathSciNet Google Scholar
Conlin, A., Martin, E. & Morris, A. Confidence limits for contribution plots. J. Chemom. 14(5–6), 725–736 (2000).
Article CAS Google Scholar
Wang, G., Li, J., Sun, C. & Jiao, J. Least squares and contribution plot based approach for quality-related process monitoring. IEEE Access 6, 54158–54166 (2018).
Article Google Scholar
MacGregor, J. F., Jaeckle, C., Kiparissides, C. & Koutoudi, M. Process monitoring and diagnosis by multiblock PLS methods. AIChE J. 40(5), 826–838 (1994).
Article CAS Google Scholar
Zhu, W., Zhen, W. & Jiao, J. Partial derivate contribution plot based on KPLS-KSER for nonlinear process fault diagnosis, in 2019 34rd Youth Academic Annual Conference of Chinese Association of Automation (YAC) (2019).
Luo, L., Bao, S., Mao, J. & Tang, D. Fault detection and diagnosis based on sparse PCA and two-level contribution plots. Ind. Eng. Chem. Res. 56(1), 225–240 (2016).
Article Google Scholar
Westerhuis, J. A., Kourti, T. & MacGregor, J. F. Analysis of multiblock and hierarchical PCA and PLS models. J. Chemom. 12(5), 301–321 (1998).
Article CAS Google Scholar
Wold, S., Kettaneh, N. & Tjessem, K. Hierarchical multiblock PLS and PC models for easier model interpretation and as an alternative to variable selection. J. Chemom. 10(5–6), 463–482 (1996).
Article CAS Google Scholar
Tong, C. & Yan, X. A novel decentralized process monitoring scheme using a modified multiblock PCA algorithm. IEEE Trans. Autom. Sci. 14(2), 1129–1138 (2015).
Article Google Scholar
Schwarz, C., Ackert, P. & Mauermann, R. Principal component analysis and singular value decomposition used for a numerical sensitivity analysis of a complex drawn part. Int. J. Adv. Manuf. Technol. 94(5–8), 2255–2265 (2018).
Article Google Scholar
Lou, Z., Liu, B., Xie, H. & Wang, Y. Adjustment of basal insulin infusion rate in T1DM by hybrid PSO. Soft. Comput. 19(7), 1921–1937 (2015).
Article Google Scholar
Lou, Z., Wang, Y., Lu, S. & Sun, P. Process monitoring using a novel robust PCA scheme. Ind. Eng. Chem. Res. 60(11), 4397–4404 (2021).
Article CAS Google Scholar
Zhang, X., Zou, Y., Li, S. & Xu, S. A weighted auto regressive LSTM based approach for chemical processes modeling. Neurocomputing 367, 64–74 (2019).
Article Google Scholar
Lou, Z., Shen, D. & Wang, Y. Preliminary-summation-based principal component analysis for non-Gaussian processes. Chemom. Intell. Lab. Syst. 146, 270–289 (2015).
Article CAS Google Scholar
Botev, Z. I., Grotowski, J. F. & Kroese, D. P. Kernel density estimation via diffusion. Ann. Stat. 38(5), 2916–2957 (2010).
Article MathSciNet Google Scholar
Bansal, J. C. Particle swarm optimization, in Evolutionary and swarm Intelligence Algorithms, 11–23 (Springer, 2019).
Jiang, Q. & Yan, X. Plant-wide process monitoring based on mutual information–multiblock principal component analysis. ISA Trans. 53(5), 1516–1527 (2014).
Article Google Scholar
Xu, Y., Shen, S.-Q., He, Y.-L. & Zhu, Q.-X. A novel hybrid method integrating ICA-PCA with relevant vector machine for multivariate process monitoring. IEEE Trans. Control Syst. Technol. 27(4), 1780–1787 (2018).
Article Google Scholar
Chen, Z. et al. A distributed canonical correlation analysis-based fault detection method for plant-wide process monitoring. IEEE Trans. Industr. Inf. 15(5), 2710–2720 (2019).
Article ADS Google Scholar
Deng, X., Tian, X., Chen, S. & Harris, C. J. Deep principal component analysis based on Layerwise feature extraction and Its application to nonlinear process monitoring. IEEE T. Contr. Syst. T. 27(6), 2526–2540 (2019).
Article Google Scholar
Choi, S. W. & Lee, I.-B. Nonlinear dynamic process monitoring based on dynamic kernel PCA. Chem. Eng. Sci. 59(24), 5897–5908 (2004).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 62003220), the Natural Science Foundation of Shenzhen, China (No. JCYJ20190809114009697), and Young Talents by Department of Education of Guangdong Province, China (No. 2020KQNCX204).

Author information

Authors and Affiliations

Institute of Intelligence Science and Engineering, Shenzhen Polytechnic, Shenzhen, 518055, China
Zhijiang Lou, Shan Lu & Pei Sun
College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao, 266590, China
Youqing Wang

Authors

Zhijiang Lou
View author publications
You can also search for this author in PubMed Google Scholar
Youqing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Pei Sun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.L. and Y.W. wrote the main manuscript text and S.L. and P.S. do some simulation tests. All authors reviewed the manuscript.

Corresponding author

Correspondence to Shan Lu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lou, Z., Wang, Y., Lu, S. et al. Minimalist module analysis for fault detection and localization. Sci Rep 11, 23571 (2021). https://doi.org/10.1038/s41598-021-02676-3

Download citation

Received: 12 July 2021
Accepted: 19 November 2021
Published: 07 December 2021
DOI: https://doi.org/10.1038/s41598-021-02676-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Minimalist module analysis for fault detection and localization

Subjects

Abstract

Similar content being viewed by others

Fault diagnosis of anti-friction bearings based on Bi-dimensional ensemble local mean decomposition and optimized dynamic least square support vector machine

Monitoring of semiconductor manufacturing process on Bayesian AEWMA control chart under paired ranked set sampling schemes

An adaptive Bayesian approach for improved sensitivity in joint monitoring of mean and variance using Max-EWMA control chart

Introduction

Methods

Principal component analysis (PCA)

Contribution plot

Drawback of PCA and contribution plot method

Theorem

Proof

Section summary

Minimalist module analysis (MMA)

Minimalist module division

Remark

Independent module

Monitoring indices construction

Fault localization

Simulation study of MMA

Fault detection in the Tennessee Eastman process

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Fault diagnosis of anti-friction bearings based on Bi-dimensional ensemble local mean decomposition and optimized dynamic least square support vector machine

Monitoring of semiconductor manufacturing process on Bayesian AEWMA control chart under paired ranked set sampling schemes

An adaptive Bayesian approach for improved sensitivity in joint monitoring of mean and variance using Max-EWMA control chart

Introduction

Methods

Principal component analysis (PCA)

Contribution plot

Drawback of PCA and contribution plot method

Theorem

Proof

Section summary

Minimalist module analysis (MMA)

Minimalist module division

Remark

Independent module

Monitoring indices construction

Fault localization

Simulation study of MMA

Fault detection in the Tennessee Eastman process

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links