Performance of T2-based PCA mix control chart with KDE control limit for monitoring variable and attribute characteristics

Ahsan, Muhammad; Mashuri, Muhammad; Prastyo, Dedy Dwi; Lee, Muhammad Hisyam

doi:10.1038/s41598-024-58052-4

Download PDF

Article
Open access
Published: 28 March 2024

Performance of T²-based PCA mix control chart with KDE control limit for monitoring variable and attribute characteristics

Muhammad Ahsan¹,
Muhammad Mashuri¹,
Dedy Dwi Prastyo¹ &
…
Muhammad Hisyam Lee²

Scientific Reports volume 14, Article number: 7372 (2024) Cite this article

104 Accesses
Metrics details

Subjects

Abstract

In this work, the mixed multivariate T² control chart’s detailed performance evaluation based on PCA mix is explored. The control limit of the proposed control chart is calculated using the kernel density approach. Through simulation studies, the proposed chart’s performance is assessed in terms of its capacity to identify outliers and process shifts. When 30% more outliers are included in the data, the proposed chart provides a consistent accuracy rate for identifying mixed outliers. For the balanced percentage of attribute qualities, misdetection happens because of the high false alarm rate. For unbalanced attribute qualities and excessive proportions, the masking effect is the key issue. The proposed chart shows the improved performance for the shift in identifying the shift in the process.

Comparing the performance of Kernel PCA Mix Chart with PCA Mix Chart for monitoring mixed quality characteristics

Article Open access 20 September 2022

Performance evaluation of DEWMA3 in phase-II for capturing changes in simple linear profiles based on run rule mechanism

Article Open access 27 May 2023

Adaptive EWMA control chart for monitoring the coefficient of variation under ranked set sampling schemes

Article Open access 17 October 2023

Introduction

Statistical process control (SPC) is a statistical methodology for monitoring and controlling the variation of a process to ensure that it produces products that meet customer requirements. A control chart, which is part of SPC, is one of the tools often used to monitor the company’s quality of products and services¹. Based on the number of monitored quality characteristics, the control charts are divided into two types: univariate and multivariate control charts. The univariate control charts monitor only one quality characteristic, while the multivariate control charts are applied to monitor more than one quality characteristic.

In the current industrial era 4.0, it is hoped that a process can not only be monitored from one type of quality characteristic. For example, in monitoring the variable characteristics (in a numerical scale such as height or weight), a control variable chart is used. Meanwhile, attribute control charts are always employed to monitor categorical or attribute data (such as color or hardness)². Monitoring a mixed quality characteristic in the manufacturing process is important³. However, the monitoring procedure for mixed quality characteristics was commonly conducted in individual ways in the past. The inefficiency will happen due to the need for calculating two statistics and control limits. Consequently, the administrator will have hardship in determining the monitoring result if the two procedures yield a different result. Therefore, a new concept of monitoring mixed characteristics is urgently needed.

Ahsan et al.⁴ proposed a new monitoring procedure based on the PCA Mix algorithm to overcome this issue. This work also extended to detecting outliers for various numbers of contaminated outliers⁵. The T² statistics are used to form the control chart in this method. Meanwhile, due to the unknown distribution, the control limit of the PCA Mix chart is estimated using the kernel density, a non-parametric method to estimate the empirical density from the unknown distribution⁶. However, in this work, the performance of the PCA Mix chart is only evaluated for one categorical data or attribute characteristic in detecting outliers. Additionally, both variable and attribute qualities are tracked in the effectiveness of the PCA Mix chart in identifying a change in the process. There is no suggestion for what shift this chart performs best, as a result.

Based on those reasons, this work is proposed to evaluate in detail the performance of the PCA Mix chart for detecting outliers and shift in the process. Similar to the PCA Mix chart proposed by Ahsan et al.⁴, the proposed chart also employed the kernel density estimation (KDE) in calculating the control limit. The proposed chart is evaluated for more than one attribute characteristic detecting outliers. On the other hand, the proposed chart is evaluated for a different kind of shift and correlation when the process change is being monitored. In this work, it is also shown how the proposed chart is used to monitor actual data and how its performance is compared.

The remaining portions of this work are structured as follows: Sect. “Related works” reports the connected works of this research. The charting processes for the suggested method were provided in Sect. “PCA mix”. In Sections “Charting procedures” and “Performance in detecting outlier”, performance assessments for identifying outliers and process adjustments are presented. Furthermore, Sect. “Performance evaluation in monitoring process shift” illustrates how the suggested strategy is used to track the actual dataset. Section “Application in the real cases” provides a summary of the conclusion.

Related works

Recent advancements in the control chart are discussed in this section. This section covers three different categories of control charts: multivariate variable charts, attribute charts, and mixed charts. Three different multivariate control chart types such as Hotelling’s T², Multivariate EWMA, and Multivariate CUSUM are the main emphasis of this development. The three different multivariate variable charts’ recent developments are summarized in Table 1. Table 2 lists the most current attribute chart works. The table demonstrates that current research has mostly concentrated on attribute charts using fuzzy, Poisson, and multinomial data. Recent advancements in the control chart are discussed in this section. In this section, the multivariate variable chart, attribute chart, and flow chart are the three primary forms of control charts that are covered.

Table 1 Multivariate variable chart’s most recent advancement.

PCA mix control chart’s procedures
Step 1 Input the variable data X₁ and the attribute data X₂
Step 2 Calculate the principal component scores (PCs) mix, denoted as \({\mathbf{Y}}^{mix}\), using the PCA Mix method from X₁ and X₂
Step 3 Take the first v components and calculate \(\tilde{T}_{i}^{2} = \sum\limits_{v = 1}^{l} {\frac{{(y_{i,v}^{mix} - \tilde{\mu }_{v} )}}{{\lambda_{mix,v}^{{}} }}^{2} } ,\) where \(\lambda_{v}\) is the eigenvalue for the v-th PCs
Step 4 Calculate the empirical density of \(\tilde{T}_{i}^{2}\) statistics, \(\hat{f}_{h} (\tilde{T}_{{}}^{2} ) = \frac{1}{{n\widehat{h}}}\sum\limits_{i = 1}^{n} {k\left( {\frac{{T_{{}}^{2} - \tilde{T}_{i}^{2} }}{{\widehat{h}}}} \right)}\), where \(\widehat{h}\) is the optimum bandwidth calculated using Botev, Grotowski, and Kroese algorithm ³⁷
Step 5 Calculate the distribution function \(\tilde{T}_{i}^{2}\) statistics, \(\widehat{F}_{h}^{{}} (\widetilde{t}) = \int\limits_{0}^{{\widetilde{t}_{{}}^{2} }} {\hat{f}_{h} (\tilde{T}_{{}}^{2} )d} \tilde{T}_{{}}^{2}\)
Step 6 Calculate the KDE control limit \(\widetilde{CL} = \widehat{F}_{h}^{ - 1} (\widetilde{t})(1 - \alpha )\), when process is in-control
Step 7 Plot the \(\tilde{T}_{i}^{2}\) along with KDE control limit \(\widetilde{CL}\) to form the PCA Mix Control Chart

Subjects

Abstract

Similar content being viewed by others

Comparing the performance of Kernel PCA Mix Chart with PCA Mix Chart for monitoring mixed quality characteristics

Performance evaluation of DEWMA3 in phase-II for capturing changes in simple linear profiles based on run rule mechanism

Adaptive EWMA control chart for monitoring the coefficient of variation under ranked set sampling schemes

Introduction

Related works

PCA mix

Charting procedures

Performance in detecting outlier

Two attribute characteristics

Three attribute characteristics

Five attribute characteristics

Performance evaluation in monitoring process shift

Shift in variable characteristics

Shift in attribute characteristics

Shift in variable and attribute characteristics

Different correlation

Application in the real cases

Machine failure dataset

NSL-KDD dataset

Conclusions

Ethics approval

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links