Monitoring gamma type-I censored data using an exponentially weighted moving average control chart based on deep learning networks

Lee, Pei-Hsi; Liao, Shih-Lung

doi:10.1038/s41598-024-56884-8

Download PDF

Article
Open access
Published: 18 March 2024

Monitoring gamma type-I censored data using an exponentially weighted moving average control chart based on deep learning networks

Pei-Hsi Lee¹ &
Shih-Lung Liao¹

Scientific Reports volume 14, Article number: 6458 (2024) Cite this article

221 Accesses
Metrics details

Subjects

Abstract

In recent years, deep learning methods have been widely used in combination with control charts to improve the monitoring efficiency of complete data. However, due to time and cost constraints, data obtained from reliability life tests are often type-I right censored. Traditional control charts become inefficient for monitoring this type of data. Thus, researchers have proposed various control charts with conditional expected values (CEV) or conditional median (CM) to improve efficiency for right-censored data under normal and non-normal conditions. This study combines the exponentially weighted moving average (EWMA) CEV and CM chart with deep learning methods to increase efficiency for gamma type-I right-censored data. A statistical simulation and a real-world case are presented to assess the proposed method, which outperforms the traditional EWMA charts with CEV and CM in various skewness coefficient values and censoring rates for gamma type-I right-censored data.

Use of improved memory type control charts for monitoring cancer patients recovery time censored data

Article Open access 07 March 2024

Remaining useful lifetime estimation for discrete power electronic devices using physics-informed neural network

Article Open access 22 June 2023

A survival model generalized to regression learning algorithms

Article 21 June 2021

Introduction

In recent years, artificial-intelligence technology has been widely used in production processes to reduce labor and quickly respond to process variations. In particular, machine learning and deep learning methods have been introduced into process control and excelled in quickly detecting and correctly identifying abnormal process conditions. For example, Lee and Kim¹, Wang et al.², Chen and Yu³, Kim and Ha⁴, Yeganeh et al.⁵, and Sabahno and Amiri⁶ proposed control charts based on machine or deep learning to improve detection ability. Zhang et al.⁷, Yu and Liu⁸, Maged et al.⁹, and Yu et al.^10,11 achieved excellent performance in failure detection in high-dimensional or complex processes using deep learning methods. Moreover, Zan et al.¹², Lu et al.¹³, Yu and Zhang¹⁴, Lee et al.¹⁵, and Xue et al.¹⁶ used machine and deep learning methods to recognize abnormal control chart patterns.

To test the lifetime of electronic products, practitioners may set a test termination time at which to halt the test in order to reduce time and cost. As some tested units may not have failed by the termination time, practitioners can only obtain incomplete lifetime records, which are known as type-I right-censored data. This is a challenge for practitioners because the detection efficiency for process variation of traditional Shewhart-type and EWMA-type control charts decrease for incomplete data. Thus, some researchers have proposed novel control charts to monitor right-censored data^10,11,17,18. Steiner and Mackay¹⁹ estimated the mean lifetime of incomplete data with a CEV and combined it with an $\overline{X}$ chart of a lower control limit (LCL) to detect decreases of the lifetime for highly right-censored data with assumptions of normality. Then, Steiner and Mackay²⁰ used Shewhart-type CEV control charts for censored data with non-normality conditions. They next Steiner and Mackay²¹ combined the EWMA and CEV to increase the detection ability for highly right-censored data. Lee²² determined the design parameters of the CEV $\overline{X}$ control chart with a minimum cost perspective.

Zhang and Chen²³ assumed a Weibull distribution for the data and detected decreases and increases in the average lifetime for censored data using two single-sided EWMA CEV control charts. Tsai and Lin²⁴, Raza et al.²⁵, and Biozone and Wang²⁶ also developed EWMA-type CEV control charts and achieved good monitoring performance in non-normal censored data for their proposed charts. Raza et al.²⁷ and Raza and Siddiqi²⁸ established a double EWMA (DEWMA) control chart to detect shifts of the scale parameter for gamma censored data. Raza and Siddiqi²⁸, Raza et al.²⁹, and Ali et al.³⁰ proposed similar contributions to the DEWMA CEV control chart. Ali et al.³¹ and Ahmed et al.^32,33 developed novel control charts to monitor Weibull and generalized exponential (GE) censored data by replacing CEV with CM and conditional standard deviation (CSD). In addition, Lee et al.³⁴ and Zhao and Wu³⁵ implemented the EWMA CEV chart for multiple censored data and window-censored data, respectively. However, to the authors’ knowledge, there have been no relevant studies monitoring right-censored data using deep learning methods.

Convolutional neural networks (CNN) and long short-term memory (LSTM) networks are common methods of deep learning that have been successfully combined with control chart technology to monitor processes and recognize abnormal patterns in control charts^7,14,36. Thus, these techniques may be used to improve the efficiency of a control chart for censored data. From the literature, most previous works developed CEV and CM charts for normal, GE or Weibull censored data. Gamma data is also often observed in reliability life tests and is one of the commonly used lifetime distributions because changing its shape parameter can create distributions with different degrees of skewness. Therefore, this study combines a CNN and LSTM, respectively, with the EWMA CEV and CM charts to monitor gamma type-I right-censored data.

The remainder of this study is organized as follows. In “Methodology” section presents the methodology. In “Proposed control charts” section describes the proposed control chart and in “Performance comparison” section investigates its performance using a statistical simulation. In “Real-world case study” section details implementation of the proposed method for a case study. Finally, in “Conclusions” section provides some conclusions and directions for future development.

Methodology

EWMA charts for gamma type-I censored data

Let $U = \left\{ {u_{1} ,u_{2} , \ldots ,u_{m} } \right\}$ be a gamma random variable and $a_{0}$ and $b_{0}$ be the shape and scale parameters of an in-control process, respectively. The censoring rate for the gamma lifetimes can be represented as $Pc = 1 - F_{Ga} \left( {u = c_{T} {|}a_{0} ,b_{0} } \right)$, where $F_{Ga} \left( { \cdot {|}a_{0} ,b_{0} } \right)$ is the cumulative distribution function (CDF) of the gamma distribution with parameters $a_{0}$ and $b_{0}$, where $c_{T}$ is the censoring time. The CEV of the gamma distribution is:

$$Cev = E\left( {U{|}u \ge c_{T} } \right) = \frac{{a_{0} b_{0} \left[ {1 - F_{Ga} \left( {u = c_{T} {|}a_{0} + 1,b_{0} } \right)} \right]}}{{1 - F_{Ga} \left( {u = c_{T} {|}a_{0} ,b_{0} } \right)}},$$

(1)

and the CM of the gamma distribution is

$$CM = F_{Ga}^{ - 1} \left( {0.5 - 0.5F_{Ga} \left( {u = c_{T} {|}a_{0} ,b_{0} } \right){|}a_{0} ,b_{0} } \right),$$

(2)

where $F_{Ga}^{ - 1} \left( { \cdot {|}a_{0} ,b_{0} } \right)$ is an inverse of CDF of the gamma distribution with $a_{0}$ and $b_{0}$. The derivations of Eqs. (1) and (2) are shown in Supplementary. Practitioners take $n$ samples and measure their lifetime values using the reliability life testing method. The sample mean of size $n$ can be obtained by $\overline{X} = \sum\nolimits_{i = 1}^{n} {x_{i} } /n$. Let $u_{i}$ be the lifetime of the i-th testing sample. Then, the $x_{i}$ of the i-th testing sample in the $\overline{X}$’s formula is:

$$x_{i} = \left\{ {\begin{array}{*{20}l} {u_{i} } \hfill & {\quad for\;u_{i} \le c_{T} } \hfill \\ {Cd} \hfill & {\quad for\;u_{i} > c_{T} } \hfill \\ \end{array} } \right.,$$

(3)

where $Cd = Cev$ for CEV $\overline{X}$ statistic or $CM$ for CM $\overline{X}$ statistic. The in-control mean $M_{0}$ and variance $V_{0}$ can be expressed as follows:

$$\begin{aligned} & M_{0} = a_{0} b_{0} \;\;{\text{and}} \\ & V_{0} = a_{0} b_{0}^{2} . \\ \end{aligned}$$

Let $\lambda$ be the smoothing parameter of the EWMA chart. Zhang and Chen²³ showed that the EWMA statistic at period j for monitoring the mean decrease is:

$$E_{j} = min\left\{ {M_{0} ,\lambda \overline{{X_{j} }} + \left( {1 - \lambda } \right)E_{j - 1} } \right\}.$$

(4)

For an EWMA chart with CEV or CM used to monitor a process mean, an LCL is set to signal the mean reduction because practitioners always focus on the detection of average lifetime reduction. The appearance of an assignable cause leads to a decrease in the process mean, indicating an out-of-control condition. Let $M_{1} = \delta \times M_{0}$ be the mean of an out-of-control state where the process variance is unchanged and $\delta$ be a mean shift size that can be obtained by $\delta = M_{1} /M_{0}$. The gamma shape parameter $a_{1}$ and scale parameter $b_{1}$ in an out-of-control state can be obtained by solving the following system of simultaneous equations:

$$\left\{ {\begin{array}{*{20}l} {M_{1} = a_{1} b_{1} } \hfill \\ {V_{0} = a_{1} b_{1}^{2} } \hfill \\ \end{array} } \right.,$$

The solutions for $a_{1}$ and $b_{1}$ are, respectively, as follows:

$$\begin{aligned} a_{1} & = M_{1}^{2} /V_{0} \;\;{\text{and}} \\ b_{1} & = V_{0} /M_{1} . \\ \end{aligned}$$

(5)

The primary metric employed for evaluating the effectiveness of control charts is the average run length (ARL)^{3,15,23,30,34}. In an in-control process, a larger ARL signifies a reduced false-alarm rate, while in an out-of-control state, a smaller ARL indicates quicker detection of mean reduction.

CNN

The main advantage of CNNs is that they can effectively capture local features in the data and perform feature extraction for classification or regression prediction, while maintaining the spatial hierarchy of features. Some of the main features and working principles of CNN are laid out below.

The convolutional layer is a fundamental element of CNNs. It operates by applying convolutional kernels (also referred to as filters) over the input data or image through a sliding process, resulting in the generation of feature maps. This operation supports the identification of local features. The output of a convolutional layer $l$ can be expressed by:

$$\zeta_{l}^{CL} = f_{RL} \left( {b_{l}^{CL} + \zeta_{l - 1}^{CL} \times \omega_{l - 1}^{CL} } \right),$$

where $f_{RL} \left( \cdot \right)$ is a rectified linear unit (ReLU) activation function, $l$ is the $l$-th layer of the CNN, $\omega_{l - 1}^{CL}$ is the filter kernel at layer $l - 1$, and $b_{l}^{CL}$ is the bias vector at layer $l$.

The pooling layer plays a crucial role in CNNs by decreasing the dimensionality of feature maps while retaining vital information. The most common pooling operation is max pooling, which selects the highest value within specific regions, thus reducing the feature map’s dimensions. The pooling output at layer $l$ is given by:

$$Mp_{j} = max\left( {\zeta_{l}^{CL} } \right).$$

After the convolutional and pooling layers, CNNs frequently incorporate fully connected layers to carry out ultimate classification or regression tasks. These layers are responsible for transforming the extracted features into the network’s final output. CNNs commonly comprise a series of convolutional and pooling layers stacked in an interleaved fashion. This layered architecture empowers the network to acquire knowledge about image characteristics spanning diverse levels of abstraction^7,8.

LSTM

An LSTM is a deep learning neural network architecture that is an improvement of over traditional recurrent neural networks (RNN). It was specially designed to process sequence data, such as speech recognition, natural language processing, time-series analysis, and other applications. LSTM supports sequence data processing by introducing the three key gating mechanisms described below.

The forget gate determines whether to forget the previous memory information. It uses a sigmoid function to output a value between 0 and 1, controlling whether past memories are retained. The forget gate ${f}^{fg}$ can be expressed as:

$$f^{fg} = f_{sf} \left( {b^{fg} + \omega^{fg} \left[ {\zeta_{t - 1} ,\vartheta_{t} } \right]} \right),$$

where $f_{sf} \left( \cdot \right)$ is a sigmoid function, $b^{fg}$ is the bias vector of the forget gate, $\omega^{fg}$ is the weight vector of the forget gate, $\zeta_{t - 1}$ is the output vector of the previous step, and $\vartheta_{t}$ is the input vector of the current step.

The input gate determines how new memory information is added to the LSTM unit. It uses a sigmoid function to determine which information needs updating and uses the tanh function to create a memory cell vector. Let $f^{ig}$ be the formula of the input gate and $\epsilon_{t}$ be the memory cell vector of the current step, as follows:

$$\begin{aligned} & f^{ig} = f_{sf} \left( {b^{ig} + \omega^{ig} \left[ {\zeta_{t - 1} ,\vartheta_{t} } \right]} \right)\;\;{\text{and}} \\ & \epsilon_{t} = f^{fg} \times \epsilon_{t - 1} + f^{ig} \times tanh\left( {b^{\epsilon} + \omega^{\epsilon} \left[ {\zeta_{t - 1} ,\vartheta_{t} } \right]} \right), \\ \end{aligned}$$

where $b^{ig}$ is the bias vector of the input gate, $\omega^{ig}$ is the weight vector of the input gate, $tanh\left( \cdot \right)$ is the tanh function, and $b^{\epsilon}$ and $\omega^{\epsilon}$ are the bias and weight vectors of the memory cell vector for the current step, respectively.

The output gate determines which memory information will be outputted to the next time step. Similar to the forget gate and input gate, the output gate uses a sigmoid function to control the output. The output gate $f^{og}$ is:

$$f^{og} = f_{sf} \left( {b^{og} + \omega^{og} \left[ {\zeta_{t - 1} ,\vartheta_{t} } \right]} \right),$$

where $b^{og}$ is the bias vector of the output gate and $\omega^{og}$ is the weight vector of the output gate. The output vector of the current step $\zeta_{t}$ is:

$$\zeta_{t} = f^{og} \times tanh\left( {\epsilon_{t} } \right).$$

The dimensions of $\zeta_{t}$ as the number of hidden units affects the LSTM network’s computing efficiency and effectiveness. LSTM uses these gates to control the flow of information and update memory, thus helping to address the gradient vanishing problem in RNNs and allowing them to better handle long sequences^9,36.

Proposed control charts

This section proposes a procedure to set and implement the control chart based on deep learning networks with EWMA CEV or CM statistic, as shown in Fig. 1. This procedure includes two parts: setting up the control chart and implementing the control chart.

The purpose of setting up the control chart is to find the optimal threshold value according to a specific in-control ARL value to maintain the monitoring effect of the control chart. Before setting up a control chart, practitioners can determine an in-control ARL value and choose an initial threshold value based on their own experience. In the literature, the in-control ARL value used for control chart performance comparison is typically 200 or 370.4.

According to the quality characteristic parameters and control parameters such as $a_{0}$, $b_{0}$, $\lambda$, $Pc$, and $n$, the in-control gamma type-I censored data can be generated using the Monte Carlo method and then the in-control EWMA statistics $E_{j}$ for the gamma type-I censored data can be calculated using Eq. (4). It is noted that $E_{j}$ is the EWMA CEV statistic for $Cd = Cev$ or the EWMA CM statistic for $Cd = CM$. First, 10,000 $E_{j}$ are generated. Next, let $\left\{ {E_{t - 1} ,E_{t} } \right\}$ be the training data set and be inputted into the CNN or LSTM network for training. As the training data set, the input layer of the CNN or LSTM network must use one-dimensional sequence data. The input and output vectors have dimensions $1 \times \left( {m - 1} \right)$.

Practitioners are more concerned about decreases in the average lifetime, so the threshold value $\eta$ as a LCL of traditional control charts is set to detect such decreases. After the network is trained, the estimation $\widehat{{E_{t} }}$ can be outputted and the residual value can be computed by $E_{t} - \widehat{{E_{t} }}$. If the j-th residual value is less than the threshold value $\eta$, then the number of points outside the threshold value (OC) = OC + 1. The above procedure is repeated 10,000 times to obtain total number of points outside the threshold value and the in-control ARL value is 10,000/OC (Note that the false alarm rate is OC/10,000).

If the simulated in-control ARL value is not equal to the specific in-control ARL value, then the threshold value $\eta$ is adjusted and the above simulation procedure is repeated until the simulated ARL value equals the specific ARL value. In this way, the optimal threshold value $\eta^{*}$ can be obtained and implemented to monitor the process.

In the implementation of the control chart for process monitoring, practitioners import a trained CNN or LSTM network, set up the optimal threshold value $\eta^{*}$, and then apply the following steps:

(1)
Collect lifetime data for period t − 1 and calculate the statistic $E_{t - 1}$.
(2)
Input $E_{t - 1}$ into this trained network to predict the statistic $\widehat{{E_{t} }}$ of period t.
(3)
After the lifetime data for period t is obtained, compute the actual value $E_{t}$.
(4)
Let the error value for period t be $E_{t} - \widehat{{E_{t} }}$.

If the error value is less than the optimal threshold value $\eta^{*}$, then the process indicates an out-of-control condition; otherwise, the process is in control.

Figure 2 shows the simulation process for an out-of-control ARL value. Practitioners can give a shift size value $\delta$ to calculate the parameters $a_{1}$ and $b_{1}$ of the out-of-control state shown in Fig. 2 using Eq. (5).

Some studies typically considered two different types of performance: zero-state (ZS) and steady-state (SS)^37,38. ZS performance assumes that a shift occurs at the beginning of the process to measure the out-of-control ARL value. SS performance shows the out-of-control ARL for control charts to identify a process shift for control statistics to reach a static distribution.

For the simulated data generation of ZS and SS conditions, assume that the process has been continuously run for $\pi$ sampling periods and maintained in the in-control state, $\pi$ represents the length of the process to reach SS condition. The process occurs the mean shift between $\pi$th and $\pi$ + 1st sampling, and then SS ARL represents the expected value of the number of samples obtained from the occurrence of this mean shift to when the chart indicates an out-of-control signal. In the data generation process, the in-control data of $\pi$ periods is first generated. After the in-control data, 10,000 out-of-control data are generated. The EWMA statistics are calculated for the data of $\pi$ + 10,000 using Eq. (4), and the last 10,000 EWMA statistics are taken to simulate the out-of-control ARL values of SS condition according to Fig. 2. For ZS condition, the simulated data can be generated with a setting of $\pi = 0$.

MATLAB R2023a provides a deep learning toolbox that can easily implement the processes of Figs. 1 and 2. This study codes the simulation processes using MATLAB R2023a to investigate the performance of the proposed control charts.

Performance comparison

Hereafter, ‘CNN chart’ and ‘LSTM chart’ represent the control charts based on the CNN and LSTM networks, respectively. This section compares the ARL performance of the CNN, LSTM, and EWMA charts with CEV or CM for gamma type-I censored data. The parameters of the gamma distribution are set as $a_{0}$ = 1, 2, and 4 and $b_{0} = 1$ for comparison, while the skewness values (Sk) of the parameter case are as shown in Table 1. Smaller $a_{0}$ values indicate greater skewness of the gamma distribution.

Table 1 Skewness values of the gamma distribution.

Full size table

When using the CNN network, some network parameters, such as the stacked numbers of convolutional and pooling layers, kernel size, and number of kernels, must be determined first to achieve good training and testing results. For the LSTM chart, the stacked numbers of LSTM layers and the number of hidden units must also be decided. Based on the literature, this study uses the trial-and-error method to set these network parameters as per Table 2^3,39.

Table 2 Structure and parameters of deep learning networks for the proposed control charts.

Full size table

In the performance comparison, the sample size is fixed at $n = 5$, which is standard practice for sampling and plotting control charts. The smoothing parameter $\lambda$ of the EWMA statistic is set at 0.1 and 0.2^23,27,29. $Pc$ is 0.2, 0.5, and 0.8 for lower, moderate, and higher censoring rates, respectively. The shift sizes are $\delta = 0.8$ and 0.7 for small shifts; $\delta = 0.6$ and 0.5 for moderate shifts; and $\delta = 0.2$ for large shifts. This study set the in-control ARL value at 200 to measure the out-of-control ARL values under ZS and SS conditions.

Considering that different trained CNN or LSTM networks will have different ARL values, this study trained 100 networks for each condition of process parameters ($a_{0}$, $b_{0}$, $Pc$ and $\lambda$) and then selected a trained network with the smallest ARL value for comparison from the 100 trained networks.

Table 3 shows the LCL and the optimal threshold value $\eta^{*}$ for the six comparison charts under ZS condition ($\pi$ = 0). The out-of-control ARL values for the six control charts were simulated according to the above conditions and the ARL values are compared in Table 4. The bold cells indicate the control chart with the best detection efficiency for a specific shift size.

Table 3 Design parameter values of six control charts for comparison under ZS condition.

Full size table

Table 4 ZS ARL values of six control charts.

Full size table

As the skewness coefficient value decreases, the detection efficiency of the six control charts decreases. As $Pc$ increases, the detection efficiency of the EWMA chart decreases, and the detection efficiency of the CNN and LSTM charts changes irregularly. The CNN chart exhibits the best detection ability for most shift sizes. The LSTM chart is significantly worse than the other charts for all shift sizes. Comparing the performance of CEV and CM, CNN with CEV is better than CNN with CM when the skew coefficient value is small. As the skew coefficient value becomes larger, the effect of CNN with CM is better than CNN with CEV. EWMA and LSTM charts are not affected by the skewness coefficient. EWMA and LSTM charts with CEV has better performance than EWMA and LSTM charts with CM.

For processes that often occur the mean shift in the initial stage, if the skewness coefficient of lifetime distribution is large, the EWMA CM statistic should be used to train the CNN network and implement monitoring. On the contrary, the EWMA CEV statistic should be considered to train the CNN network and implement monitoring.

This study considers $\pi$ = 100 and 1000 to measure the ARL value of SS condition. Table 5 shows the LCL and $\eta^{*}$ of six control charts for $\pi$ = 1000. Table 7 exhibits the SS ARL values for $\pi$ = 100 and 1000, respectively.

Table 5 Design parameter values of six control charts for comparison under SS condition.

Full size table

As Table 6, CNN chart outperforms EWMA and LSTM for most shift sizes in the $\pi$ = 1000. With only some cases of $\delta$ = 0.8, the EWMA chart is better than the CNN chart. LSTM chart in most cases of $\delta$ ≤ 0.5 has better detection efficiency than EWMA chart, but it's still not as efficient as CNN chart. The detection efficiency of LSTM charts in other shift sizes is worse than that of EWMA and CNN charts. The detection efficiency of the EWMA, CNN and LSTM charts decreases as $Pc$ increases or the skewness coefficient of lifetime distribution becomes large for most shift sizes. EWMA, CNN and LSTM charts with CEV have better detection efficiency than EWMA, CNN and LSTM charts with CM in these cases of $Pc=0.2$ for all ${a}_{0}$ values and gamma parameters (${a}_{0}$ = 4, ${b}_{0}$ = 1) for all $Pc$ values.

Table 6 SS ARL values of six control charts with $\pi = 1000$.

Full size table

Comparing the ARL values of Tables 4 and 6 for ZS ($\pi$ = 0) and $\pi$ = 1000, the EWMA charts with CEV and CM perform worse detection efficiency under ZS condition than under SS condition. In the LSTM charts with CEV and CM, the detection efficiency increases as $\pi$ increases for most shift sizes, but the efficiency of some small shift sizes performs irregular changes. The CNN charts with CEV and CM have more excellent performance under ZS condition than under SS condition for most shift sizes. As $\pi$ increases, the detection ability of the CNN charts with CEV and CM in most shift sizes slightly reduced but, in most cases of $\delta$ = 0.8, the detection efficiency is significantly reduced.

When the mean shifts occur after the process has been running for a long time, CNN chart will be the best choice for gamma type-I censored data unless there is a need to detect tiny shift sizes. The CNN chart with CEV is suitable for the gamma censored data of lower rates or smaller skewness values, and the CNN chart with CM is recommended for monitoring the moderately, and highly censoring gamma data of larger skewness values.

Real-world case study

A reliability life test for a liquid–crystal display module (LCM) was conducted at a temperature of 70 °C with 80% relative humidity. Based on historical data analysis, the lifetime distribution of an LCM is known to follow a gamma distribution with shape parameter ${a}_{0}=5.72$ and scale parameter ${b}_{0}=0.48$. To save testing time and cost, practitioners use the censoring rate $Pc=0.8$ to conduct the test and the censoring time ${c}_{T}$ is found to be 1.76 h. The skewness coefficient value of this lifetime distribution is 0.35, which approximates a symmetrical distribution, therefore EWMA CEV statistics is selected for monitoring the LCM’s lifetime. The CEV of this lifetime distribution can be calculated as 3.09. According to quality inspection regulations, five units of each batch of LCMs must be randomly sampled to test lifetime values. Because the EWMA chart based on the CNN network has better performance than the EWMA charts and the EWMA chart on the LSTM network, practitioners developed a CNN-based EWMA chart with CEV using $\lambda =0.1$ and in-control ARL = 200. As shown in Fig. 1, practitioners trained a CNN network using the EWMA CEV statistics in Eq. (4) and obtained the optimal threshold value ${\eta }^{*}=-0.186$ for the in-control ARL value of 200.

In line with the above, practitioners tested five units from each batch under conditions of 70 °C and 80% humidity, and halted testing when the test time reached 1.76 h. The EWMA statistics (Eq. (4)) were inputted into the well-trained CNN network to predict the statistic of the next period. Table 7 showed the lifetime data of testing units for 30 batches. In the 101st batch, only one tested unit failed at 1.23 h and other four tested units did not fail. The lifetime of four unfailed units is recorded as CEV. The actual value ${E}_{j}$ in Table 7 can be obtained by using Eq. (4). As shown in Table 7, the EWMA CEV statistic of the 101st batch was 2.75, so the practitioners inputted 2.75 into the well-trained CNN network, which outputted the predicted value of 2.82 for the 102nd batch. After the 102nd batch was produced and the life tests of the five units reached the termination time of 1.76 h, the actual value ${E}_{j}$ of this batch was determined to be 2.75 by using Eq. (4) and the error value of this batch was − 0.07.

Table 7 Predicted and error values of the CNN network for the LCM lifetime test.

Full size table

In Table 7, other error values were obtained with the same method. Figure 3a plots these error values for the CNN-based control chart. It can be seen that the error values of batches 119–130 were below the ${\eta }^{*}$ value, and so this chart indicates the variation at the 119th batch. Figure 3b also shows the EWMA CEV chart with LCL = 2.56 and its in-control ARL value is approximately 200. The EWMA CEV chart signals the variation at the 121st batch, detecting the same variation more slowly than the CNN-based EWMA chart with CEV.

Conclusions

The combination of deep learning methods and control charts has greatly improved the efficiency of process monitoring. However, poor efficiency in the monitoring of high type-I censored data using control charts is a challenge for practitioners. This study proposed a control chart based on deep learning methods with EWMA CEV and CM statistics to detect the mean lifetime reduction for gamma type-I censored data. The ZS and SS ARL values of the proposed charts were also measured. Comparing the ZS and SS ARL values of the EWMA chart and the two EWMA charts based on deep learning methods with CEV and CM, CNN-based EWMA chart outperforms other control charts under ZS condition. For SS condition, the EWMA charts based on CNN with CEV and CM outperformed the other charts for various skewness coefficient values and censoring rates for most shift sizes. The EWMA charts with CEV and CM was slightly better than the CNN-based EWMA charts with CEV and CM for a few tiny shift sizes. The EWMA charts based on LSTM with CEV and CM consistently had the worst performance under ZS and SS conditions. In addition, a real-world case study showed that the CNN-based EWMA chart detected mean lifetime reduction more efficiently than the traditional EWMA CEV chart.

For the gamma censored data of lower rates or smaller skewness coefficient values, the CNN-based EWMA chart with CEV is the best choice, and the CNN-based EWMA chart with CM is recommended monitoring the moderately, and highly censoring data of heavily skewed gamma distribution. Future work could extend current approaches to combine CUSUM CEV and CM statistics with deep learning methods to monitor the censored data with normal or non-normal distributions. In addition, there are opportunities to combine multiple deep learning methods to build control charts.

Data availability

The datasets generated and/or analyzed during the current study are not publicly available due there are still some industrial projects in progress but are available from the corresponding author on reasonable request.

References

Lee, S. & Kim, S. B. Time-adaptive support vector data description for nonstationary process monitoring. Eng. Appl. Artif. Intell. 68, 18–31. https://doi.org/10.1016/j.engappai.2017.10.016 (2018).
Article Google Scholar
Wang, F.-K., Bizuneh, B. & Cheng, X.-B. One-sided control chart based on support vector machines with differential evolution algorithm. Qual. Reliab. Eng. Int. 35(6), 1634–1645. https://doi.org/10.1002/qre.2465 (2019).
Article Google Scholar
Chen, S. & Yu, J. Deep recurrent neural network-based residual control chart for autocorrelated processes. Qual. Reliab. Eng. Int. 35(8), 2687–2708. https://doi.org/10.1002/qre.2551 (2019).
Article Google Scholar
Kim, J.-M. & Ha, I. D. Deep learning-based residual control chart for count data. Qual. Eng. 34(3), 370–381. https://doi.org/10.1080/08982112.2022.204404 (2022).
Article Google Scholar
Yeganeh, A., Chukhrova, N., Johannssen, A. & Fotuhi, H. A network surveillance approach using machine learning based control charts. Expert Syst. Appl. 219, 119660. https://doi.org/10.1016/j.eswa.2023.119660 (2023).
Article Google Scholar
Sabahno, H. & Amiri, A. New statistical and machine learning based control charts with variable parameters for monitoring generalized linear model profiles. Comput. Ind. Eng. 184, 109562. https://doi.org/10.1016/j.cie.2023.109562 (2023).
Article Google Scholar
Zhang, C., Yu, J. & Wang, S. Fault detection and recognition of multivariate process based on feature learning of one-dimensional convolutional neural network and stacked denoised autoencoder. Int. J. Prod. Res. 59(8), 2426–2449. https://doi.org/10.1080/00207543.2020.1733701 (2021).
Article Google Scholar
Yu, J. & Liu, X. One-dimensional residual convolutional auto encoder for fault detection in complex industrial processes. Int. J. Prod. Res. 60(18), 5655–5674. https://doi.org/10.1080/00207543.2021.1968061 (2022).
Article Google Scholar
Maged, A., Lui, C. F., Haridy, S. & Xie, M. Variational AutoEncoders-LSTM based fault detection of time-dependent high dimensional processes. Int. J. Prod. Res. https://doi.org/10.1080/00207543.2023.2175591 (2023).
Article Google Scholar
Yu, J. et al. Dynamic convolutional gated recurrent unit attention auto-encoder for feature learning and fault detection in dynamic industrial processes. Int. J. Prod. Res. 61(21), 7434–7452. https://doi.org/10.1080/00207543.2022.2149874 (2023).
Article Google Scholar
Yu, M., Zhao, W., Zhou, Y. & Wu, C. Robust online detection on highly censored data using a semi-parametric EWMA chart. J. Stat. Comput. Simul. 93(9), 1403–1419. https://doi.org/10.1080/00949655.2022.2139379 (2023).
Article MathSciNet Google Scholar
Zan, T., Liu, Z., Wang, H., Wang, M. & Gao, X. Control chart pattern recognition using the convolutional neural network. J. Intell. Manuf. 31, 703–716. https://doi.org/10.1007/s10845-019-01473-0 (2020).
Article Google Scholar
Lu, Z., Wang, M. & Dai, W. A condition monitoring approach for machining process based on control chart pattern recognition with dynamically-sized observation windows. Comput. Ind. Eng. 142, 106360. https://doi.org/10.1016/j.cie.2020.106360 (2020).
Article Google Scholar
Yu, Y. & Zhang, M. Control chart recognition based on the parallel model of CNN and LSTM with GA optimization. Expert Syst. Appl. 185, 115689. https://doi.org/10.1016/j.eswa.2021.115689 (2021).
Article Google Scholar
Lee, P.-H., Torng, C.-C., Lin, C.-H. & Chou, C.-Y. Control chart pattern recognition using spectral clustering technique and support vector machine under gamma distribution. Comput. Ind. Eng. 171, 108437. https://doi.org/10.1016/j.cie.2022.108437 (2022).
Article Google Scholar
Xue, L., Wu, H., Zheng, H. & He, Z. Control chart pattern recognition for imbalanced data based on multi-feature fusion using convolutional neural network. Comput. Ind. Eng. 182, 109410. https://doi.org/10.1016/j.cie.2023.109410 (2023).
Article Google Scholar
Khan, N., Aslam, M., Raza, S. M. M. & Jun, C. H. A new variable control chart under failure-censored reliability tests for Weibull distribution. Qual. Reliab. Eng. Int. 35(2), 572–581. https://doi.org/10.1002/qre.2422 (2019).
Article Google Scholar
Xu, S. & Jeske, D. R. Weighted EWMA charts for monitoring type I censored Weibull lifetimes. J. Qual. Technol. 50(2), 220–230. https://doi.org/10.1080/00224065.2018.1436830 (2018).
Article Google Scholar
Steiner, S. H. & MacKay, R. J. Monitoring processes with highly censored data. J. Qual. Technol. 32(3), 199–208. https://doi.org/10.1080/00224065.2000.11979996 (2000).
Article Google Scholar
Steiner, S. H. & MacKay, R. J. Detecting changes in the mean from censored lifetime data. In Frontiers in statistical quality control Vol. 6 (eds Lenz, H. J. & Wilrich, P. T.) 275–289 (Springer, 2001). https://doi.org/10.1007/978-3-642-57590-7_17.
Chapter Google Scholar
Steiner, S. H. & MacKay, R. J. Monitoring processes with data censored owing to competing risks by using exponentially weighted moving average control charts. J. R. Stat. Soc. Ser. C (Appl. Stat.) 50(3), 293–302. https://doi.org/10.1111/1467-9876.00234 (2001).
Article MathSciNet Google Scholar
Lee, P.-H. Economic design of a CEV x̄ control chart for determining optimal right-censored times. Qual. Technol. Quant. Manag. 18(4), 418–431. https://doi.org/10.1080/16843703.2021.1876971 (2021).
Article Google Scholar
Zhang, L. & Chen, G. EWMA charts for monitoring the mean of censored Weibull lifetimes. J. Qual. Technol. 36(3), 321–328. https://doi.org/10.1080/00224065.2004.11980277 (2004).
Article Google Scholar
Tsai, T. R. & Lin, C. C. The design of EWMA control chart for average with type-I censored data. Int. J. Qual. Reliab. Manag. 26, 397–405. https://doi.org/10.1108/02656710910950379 (2009).
Article Google Scholar
Raza, S. M. M., Riaz, M. & Ali, S. EWMA control chart for Poisson–exponential lifetime distribution under type I censoring. Qual. Reliab. Eng. Int. 32(3), 995–1005. https://doi.org/10.1002/qre.1809 (2016).
Article Google Scholar
Bizuneh, B. & Wang, F. K. Comparison of different control charts for a Weibull process with type-I censoring. Commun. Stat. Simul. Comput. 48(4), 1088–1100. https://doi.org/10.1080/03610918.2017.1406508 (2019).
Article MathSciNet Google Scholar
Raza, S. M., Riaz, M. & Ali, S. On the performance of EWMA and DEWMA control charts for censored data. J. Chin. Inst. Eng. 38(6), 714–722. https://doi.org/10.1080/02533839.2015.1016877 (2015).
Article Google Scholar
Raza, S. M. M. & Siddiqi, A. F. EWMA and DEWMA control charts for poissonExponential distribution: Conditional median approach for censored data. Qual. Reliab. Eng. Int. 33(2), 387–399. https://doi.org/10.1002/qre.2015 (2017).
Article Google Scholar
Raza, S. M., Ali, S. & Butt, M. M. DEWMA control charts for censored data using Rayleigh lifetimes. Qual. Reliab. Eng. Int. 34(8), 1675–1684. https://doi.org/10.1002/qre.2354 (2018).
Article Google Scholar
Ali, S., Raza, S. M., Aslam, M. & Butt, M. M. CEV-Hybrid Dewma charts for censored data using Weibull distribution. Commun. Stat. Simul. Comput. 50(2), 446–461. https://doi.org/10.1080/03610918.2018.1563147 (2021).
Article MathSciNet Google Scholar
Ali, S., Ahmed, N., Shah, I., Lone, S. A. & Alsubie, A. Absolute deviation-based control charts for monitoring mean of Weibull distribution with Type-I censoring. IEEE Access 9, 107519–107547. https://doi.org/10.1109/ACCESS.2021.3100845 (2021).
Article Google Scholar
Ahmed, N., Ali, S. & Shah, I. Control charts for monitoring mean of generalized exponential distribution with type-I censoring. Qual. Reliab. Eng. Int. 38(1), 592–614. https://doi.org/10.1002/qre.3003 (2022).
Article Google Scholar
Ahmed, N., Ali, S. & Shah, I. Type-I censored data monitoring using different conditional statistics. Qual. Reliab. Eng. Int. 38(1), 64–88. https://doi.org/10.1002/qre.2958 (2022).
Article Google Scholar
Lee, P. H., Torng, C. C., Kao, P. H. & Chou, C. Y. Performance of two-sided EWMA CEV control charts with multiple censored data. Int. J. Ind. Eng. Theory Appl. Pract. 29(4), 487–498. https://doi.org/10.23055/ijietap.2022.29.4.6801 (2022).
Article Google Scholar
Zhao, W. & Wu, C. Monitoring the alternating renewal processes with Weibull window-censored data. Qual. Technol. Quant. Manag. 20(4), 468–484. https://doi.org/10.1080/16843703.2022.2124789 (2023).
Article Google Scholar
Ünlü, R. Cost-oriented LSTM methods for possible expansion of control charting signals. Comput. Ind. Eng. 154, 107163. https://doi.org/10.1016/j.cie.2021.107163 (2021).
Article Google Scholar
Haq, A., Riasat, R. & Khoo, M. C. New EWMA t charts for process mean. Qual. Reliab. Eng. Int. 38(8), 4247–4266. https://doi.org/10.1002/qre.3205 (2022).
Article Google Scholar
Imran, M., Sun, J., Hu, X., Zaidi, F. S. & Tang, A. Investigating zero-state and steady-state performance of MEWMA-CoDa control chart using variable sampling interval. J. Appl. Stat. https://doi.org/10.1080/02664763.2023.2170336 (2023).
Article Google Scholar
Wang, F.-K. & Mamo, T. Hybrid approach for remaining useful life prediction of ball bearings. Qual. Reliab. Eng. Int. 35(7), 2494–2505. https://doi.org/10.1002/qre.2538 (2019).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Management, Chaoyang University of Technology, Taichung, Taiwan
Pei-Hsi Lee & Shih-Lung Liao

Authors

Pei-Hsi Lee
View author publications
You can also search for this author in PubMed Google Scholar
Shih-Lung Liao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.-H.L.: Conceptualization, methodology, writing, supervision and project administration; S.-L.L.: software, validation and investigation. All authors reviewed the manuscript.

Corresponding author

Correspondence to Pei-Hsi Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, PH., Liao, SL. Monitoring gamma type-I censored data using an exponentially weighted moving average control chart based on deep learning networks. Sci Rep 14, 6458 (2024). https://doi.org/10.1038/s41598-024-56884-8

Download citation

Received: 26 December 2023
Accepted: 12 March 2024
Published: 18 March 2024
DOI: https://doi.org/10.1038/s41598-024-56884-8

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.