Diagnosis of sudden cardiac arrest using principal component analysis in automated external defibrillators

Pham, Van-Su; Nguyen, Anh; Dang, Hoai Bac; Le, Hai-Chau; Nguyen, Minh Tuan

doi:10.1038/s41598-023-36011-9

Download PDF

Article
Open access
Published: 30 May 2023

Diagnosis of sudden cardiac arrest using principal component analysis in automated external defibrillators

Van-Su Pham¹,
Anh Nguyen¹,
Hoai Bac Dang²,
Hai-Chau Le¹ &
…
Minh Tuan Nguyen¹

Scientific Reports volume 13, Article number: 8768 (2023) Cite this article

1055 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Sudden cardiac arrest (SCA) consisting of ventricular fibrillation and ventricular tachycardia considered as shockable rhythms is a life-threatening heart disease, which is treated efficiently by the automated external defibrillator (AED). This work proposes a novel design of the SAA, which includes a k-nearest neighbors model and a subset of 8 features extracted from the ECG segments, for the SCA diagnosis on the electrocardiogram (ECG) signal. These features are addressed as the most productive subset among 31 input features based on the evaluation of the feature correlation. The recursive feature elimination algorithm combined with the Boosting model and wise-patient fivefold cross-validation method is adopted for the calculation of the average feature importance, which shows the degree of feature correlation, to construct various input feature subsets. Moreover, component feature combinations known as the representatives of the input feature subsets with an enormous level of correlation and independence are transformed from the input subsets by the principal component analysis method. The wise-patient fivefold cross-validation procedure is used for the evaluation of these component feature combinations on the validation set. The proposed SAA is certainly efficient for SCA detection with a small number of the extracted feature and relatively high diagnosis performance such as accuracy of 99.52%, sensitivity of 97.69%, and specificity of 99.91%.

Rhythmicity of neuronal oscillations delineates their cortical and spectral architecture

Article Open access 03 April 2024

Whole-heart electromechanical simulations using Latent Neural Ordinary Differential Equations

Article Open access 11 April 2024

The Medical Segmentation Decathlon

Article Open access 15 July 2022

Introduction

Shockable rhythms including ventricular fibrillation and ventricular tachycardia, known as the abnormal waveforms of the electrocardiogram (ECG) signals, are the main cause of Sudden cardiac arrest (SCA), which results in the death if the life-support services are not provided immediately for the patients. Until now, automated external defibrillator (AED) associated with a shock advice algorithm (SAA) is the most productive device for rapid diagnosis of the SCA and countershock delivery to reset the electrical system of the heart¹.

Recently, research on SCA diagnosis using intelligent technologies has focused largely on the improvement of the SAA performance. Indeed, incorrect non-shockable or shockable rhythm detection by the SAA leads to no curable solution for the patients who are under the SCA or the defibrillation which causes the artificial SCA. Moreover, machine learning (ML)^{1,2,3,4,5,6,7,8,9,10} and deep learning (DL)^{11,12,13,14,15,16} in combination with advanced signal processing techniques have been widely proposed for the SAA design in terms of AED performance improvement. Generally, the diagnosis performance of the SAA is better than that of the conventional methods, which use different thresholds for SCA classification¹⁰, and meet the American Heart Association recommendation¹⁷. The DL-based SAA designs show various advantages in comparison with the ML-based SAAs such as no feature extraction, feature score-based feature ranking, feature selection, and better representative features, which reduces the complexity of the process of SAA designs¹². However, the identification of the optimal DL models is a time-consuming procedure due to the optimization of different hyper-parameters and structures^12,13,14,16. Furthermore, the utility of the DL models requires massive data, which makes it difficult for data collection. Nevertheless, the ML techniques have proven their effectiveness for the SAA designs in terms of model construction with the small amount of data, better adaptation to binary classification, and short time consumed for the model optimization^2,3,5,6,10.

To improve the classification performance of the SAA designs using the ML or DL methods, different algorithms have been used for signal processing such as ensemble empirical mode decomposition (EEMD)⁴, discrete wavelet transform⁵, variational mode decomposition (VMD)^6,8,10,12, Taylor Fourier transform⁷, stationary wavelet transforms⁹, fixed frequency range empirical wavelet transform (FFREWT)¹³. The rationale behind the use of these techniques is the increase in the ECG signal quality, which results in better-extracted features. Indeed, subsignals, which are generated from the original ECG signals by the above techniques, contain properly the non-shockable and shockable components of the original ECG segments. Therefore, representation for recognizing the non-shockable and shockable components is better for the input features extracted from the subsignals in comparison with that extracted from the original ECG segments. Moreover, the feature selection methods, which are genetic algorithm¹, differential evolution algorithm², Gaussian genetic algorithm³, correlation attribute evaluation⁴ are also employed to address the most relevant features from the input features. In other words, the important responsibility of the feature selection algorithms is the identification of the optimal features, which are uncorrelated certainly. The diagnosis performance of the ML- and DL-based SAA designs is negligible using the advanced signal processing techniques to improve the quality of the extracted features as shown in Refs.^{5,6,7,8,9,10,12,13}, respectively. We name the application of the advanced signal processing techniques for the achievement of better feature quality as the ECG signal transformation with respect to the SAA designs in this paper.

Principal component analysis (PCA) is a well-known method for the extraction of robust features from the non-stationary ECG signals, which are associated with the rapid property changes of the waveforms. Indeed, the correlation between input features plays an essential role in the selection of the most informative features. It is clear that the utility of the PCA offers a robust approach for the feature estimation based on their correlation degrees¹⁸. Particularly, the input features are transformed into another space by the PCA, in which the principal components corresponding to the correlation degrees are ordered. Consequently, the featured representative includes a feature subset selected from the input features, which are highly uncorrelated degrees.

Motivated by that the principal components transformed from the input features by the PCA can be served as the alternative input features, which are possibly contributive to the improvement of the proposed SAA design. Moreover, the feature transformation, in which the principal components are used as the alternatives for the input of the ML and DL algorithms, has not been considered properly in previous works related to the shockable/non-shockable rhythm classification. In this paper, a novel SAA design is proposed for the SCA detection from the ECG signals using the K-nearest neighbor (KNN) model and a component feature combination (CFC). First, the feature subset is selected carefully by the recursive feature elimination (RFE) method from the input features using the feature importance computed by the Boosting (BS) model and wise-patient fivefold cross-validation (CV) procedure. Second, the feature transformation is implemented by the PCA for the construction of the CFCs, which is then used for the model optimization on the training set. Last, the ML model using the CFCs is estimated its diagnosis performance by the wise-patient fivefold CV method on the validation set. The main contributions of this work are as follows:

Evaluation of the feature correlation by the use of feature importance in the original feature space and generation of high feature uncorrelation by the PCA transformation for the construction of the alternative CFC in the principal component space.
The utility of average important values, which are computed repeatedly by the RFE algorithm including a BS model in combination with the CV method leading to a reliable estimation for the feature selection.
Proposal of a simple and effective SAA for the AED, which exposes relatively high performance with respect to the SCA detection.

Data and preprocessing

The public databases, which are the Creighton University Ventricular Tachyarrhythmia Database (CUDB) and the MIT-BIH Malignant Ventricular Arrhythmia Database (VFDB), are considered for method development and validation in this study¹⁹. The CUDB and VFDB include 35 single- and 22 double-channel records in which record lengths are 8 and 35 min, respectively. The ECG signal annotations of VF, VT, and ventricular flutter are annotated as shockable signals, whose bandwidth is ranged from 0 to 10 Hz¹⁰, while non-shockable signals contain other types of ECG signals such as normal sinus, paced, nodal rhythms, atrial fibrillation, and ventricular ectopic beats. For the achievement of a better learning process, only the first channel of the VFDB is employed for this work. A sampling frequency of 250 Hz is applied for a total of 57 records of the ECG databases, which are then divided into non-overlapping 8s segments. The sampling frequency of the databases is 250 Hz. A total of 57 records are then separated into non-overlapping 8-s segments of the ECG signals. Due to no contribution to SCA diagnosis, noise, asystole, transition rhythms, slow VT rate under 150 beats per minute of intermediate rhythms, and VF rhythms with peak-to-peak amplitude under 200 $\upmu$V are eliminated from the ECG records¹². Removal of the asystole, which is considered as NSH rythm, based on the zero amplitude of the ECG signals provides appropriate requirements for the SAA. As a result, 1135 shockable and 5185 non-shockable segments are further used for this work. Moreover, wise-patient separation, also known as inter-patient paradigm, is implemented to divide the records coming from different patients into training and validation sets. Indeed, 40 and 17 records collected from the individual patients, which account for 70% and 30% of the entire databases corresponding to 4303 and 2017 segments, are assigned for training and validation sets, respectively. The records, shockable, and non-shockable ECG segments of CUDB and VFDB databases divided into training and validation sets are given in Table 1. Figure 1 shows the shockable and non-shockable ECG segments, which are preprocessed as follows:

(i)
Generation of the smooth ECG signal by five-order moving average filtering.
(ii)
Removal of drift suppression and baseline wander by high-pass filtering with 1 Hz cutoff frequency.
(iii)
Elimination of high-frequency interference by second-order 30 Hz low-pass Butter-worth filtering. It is noteworthy that the existing AED supports a bandwidth of 1–30 Hz for monitor-type ECG⁶.

Table 1 Number of records, shockable and non-shockable ECG segments in training and validation sets.

Full size table

Method development

The proposed method is shown in Fig. 2 including three main stages. The feature extraction is implemented firstly for the collection of the input features in time and frquency domains. Here, the original ECG signals are divided into segments, which are then preprocessed by various techniques. Secondly, the recursive feature elimination algorithm is performed for the construction of different feature subsets using the importance computed by the BS model. Then, above feature subset is transformed into the CFCs in the principal domain, where the model selection and the SCA classification are completed. Lastly, the wise-patient fivefold CV procedure is employed for the validation of the selected ML and DL models using a number of the CFCs on the validation set. A feature subset corresponding to a component feature combination, which is adopted as the input of a classifier producing the highest validated diagnosis performance in terms of accuracy, is chosen as the proposed design of the SAA. All the principal components of the CFCs corresponding to IFSs are used as the input of ML and DL models. In addition, relatively high diagnosis accuracy is essential to ensure the reliability of the proposed algorithm for the application in the clinic environment. Therefore, we propose a criteria of accuracy, which is larger than 99%, for the selection of the proposed SAA.

We use 6 ML and 2 DL algorithms for this works, which are KNN, Support vector machine (SVM), Boosting (BS), Bagging (BG), Random forest (RF), Logistic regression (LR), Convolutional neural network (CNN), Long-short term memory (LSTM)²⁰. It is noteworthy that the tSNE is unconsidered as the classification model in this work due to unstable and unlabelled outcomes, which are unfitted for the simulation using CV method.

Feature extraction

The input features are extracted from the preprocessed ECG segments by application of various conventional techniques, which are investigated from a large number of the existing publications. These features, also known as the threshold algorithms, are the most common used for the shockable/non-shockable rhythm classification. Indeed, each feature is an algorithm for the calculation of a threshold using to distinguish shockable from non-shockable ECG segments. However, the thresholds are unnecessary when the outputs of the corresponded algorithms are used as the input features of the ML or DL methods¹⁰. Moreover, the feature contribution is only witnessed by their correlation within a completed feature combination for practical SCA diagnosis. There is a total of 31 features extracted from the 8-s ECG segments including temporal, spectral, and complexity features¹⁰ as follows:

Temporal features: mean absolute value (MAva), bCP, threshold crossing sample count (TCSC), threshold crossing interval (TCin), modified exponential algorithm (MEal), standard exponential algorithm (SEal), Count1, Count2, and Count3.
Spectral features: center frequency (CFre), spectral analysis (A1 and A2), power spectral analysis (PSan), center power (CPow), Y$\_$Li, bWT, bW, VF-filter leakage measure (VFLM).
Complexity features: Hilbert transform (HTra), covariance calculation (CCal), phase space reconstruction (PSre), area calculation (ACal), frequency calculation (FCal), Kurtosis (Kurt), complexity measure (CMea), dispersion entropy (DEnt), sample entropy (SEnt), energy (Ener), Renyi entropy (REnt), fuzzy entropy (FEnt), and wavelet entropy (WEnt).

The mean values of features calculated for 50 non-shockable and 50 shockable ECG segments are totally different as shown in Table 2. A total of 31 features is possibly categorized into two groups using the differences between average feature values computed for shockable and non-shockable ECG segments. These feature groups consist of 17 and 14 features, which produce large and small differences between the average values of shockable and non-shockable ECG segments. Intuitively, difference between the values is proportional to feature capability of the recognition with respect to the shockable/non-shockable ECG segments.

Table 2 Mean values of features for non-shockable and shockable ECG segments.

Full size table

Feature and model construction

Feature construction

In this stage, we use the RFE method for the construction of various feature subsets, which are then transformed into component space by the PCA. Firstly, the input features are ranked by their importance values, which are computed by the BS model in combination with the wise-patient fivefold CV procedure. Here, the features, which have largely average importance values, are considered as more important than the others. Then, the feature subsets are constructed by the removal of the features with the lowest average importance values. It is noteworthy that the BS model combined with the wise-patient fivefold CV procedure is implemented repeatedly to calculate the feature importance for every feature subsets until no feature for subtraction. Algorithm 1 represents the RFE method for the construction of different feature subsets.

Feature importance

Given that n features, denoted as $F_{n}$, are include in the input data X. A relevant measurement for each feature is computed as follows²¹:

$$\begin{aligned} R_{n}^2(Tr) = \sum \limits _{i=1}^{T-1} \hat{s}_{i}^2 I(v(i)=n), \end{aligned}$$

(1)

where I is the indicator of set v(i). Tr is a single tree, which consists of $T-1$ internal nodes. A region associated with a node v(i) is partitioned by an input variable $F_n(v(i))$ into two subregions in which the separated constants are fit to the response values. The maximal estimated improvement $\hat{s}_i^2$ in squared error risk is given by the above variable, which results in a constant fitting over the entire region. The importance of variable $F_{n}$ is the sum of the improvement over all internal nodes.

For calculation of $\hat{s}_i^2$, we note that a system contains a random output, known as response, variable y and the input or observations $x = \{x_1, x_2,..., x_K\}$. $p_t(x) = P(y_t|x)$ is the conditional probability of x given y with $y_{t} \in \{0, 1\}$, $t \in \{t_{1}, t_{2}\}$ are the binary classes, and $k \in \{1 : K\}$. L and P are the numbers of trees and iterations, respectively. Each of tree has $T-1$ internal nodes corresponding to regions $\{Z_{itp}\}_{l=1}^{L}$. L-trees are made at each iteration p to predict the corresponding residuals for each class t on the probability scale. The maximal estimated improvement $\hat{s}_i^2$ is used to estimate the splits of region $\{Z_{itp}\}$ into two subregions corresponding to class $t_{1}$ and $t_{2}$ as follows:

$$\begin{aligned} \hat{s}_i^{2}(t_{1},t_{2})=\frac{w_{t_{1}}w_{t_{2}}}{w_{t_{1}}+w_{t_{2}}} (\overline{y}_{t_{1}}-\overline{y}_{t_{2}})^{2}, \end{aligned}$$

(2)

where pseudoresponses $\overline{y}_{t_{1}}$ and $\overline{y}_{t_{2}}$ are the means of left and right daughter responses. $w_{t_{1}}$ and $w_{t_{2}}$ are the corresponding sums of weights. These are computed with respect to observation $x_{k}$ as follows:

$$\begin{aligned} \overline{y}_{kt}=\frac{L-1}{L}\frac{y_{kt}-p_{t}(x_{k})}{p_{t}(x_{k})(1-p_{t}(x_{k}))}, \end{aligned}$$

(3)

$$\begin{aligned} w_{t}(x_{k})=p_{t}(x_{k})(1-p_{t}(x_{k})). \end{aligned}$$

(4)

The importance measure is simply averaged over the tree based on the generalization of the additive tree expansions as follows:

$$\begin{aligned} R_{n}^2 =\frac{1}{L} \sum \limits _{l=1}^{L} R_{n}^2(Tr_{l}). \end{aligned}$$

(5)

For binary classification, two separate models $M_{1}(x)$ and $M_{2}(x)$ are generated in which each model contains a sum of trees as follows:

$$\begin{aligned} M_{1}(x) =\sum \limits _{l=1}^{L} Tr_{1l}(x), \end{aligned}$$

(6)

$$\begin{aligned} M_{2}(x) =\sum \limits _{l=1}^{L} Tr_{2l}(x). \end{aligned}$$

(7)

Then, the Eq. (5) is generalized for binary classes as follows:

$$\begin{aligned} R_{n1}^2 =\frac{1}{L} \sum \limits _{l=1}^{L} R_{n}^2(Tr_{1l}), \end{aligned}$$

(8)

$$\begin{aligned} R_{n2}^2 =\frac{1}{L} \sum \limits _{l=1}^{L} R_{n}^2(Tr_{2l}). \end{aligned}$$

(9)

The final relevance of $F_{n}$ is computed as the mean of two classes as follows:

$$\begin{aligned} R_{n}^2 =\frac{1}{2}(R_{n1}^2+R_{n2}^2). \end{aligned}$$

(10)

Here, the utility of binary classification results in relevances of $F_{n}$ related to the observations of the individual class separated from another. Hence, $R_{n1}$ and $R_{n2}$ are the relevances of $F_{n}$ corresponding to two classes.

Obviously, the importance measures are relative and obtained by the respective square roots as shown in Eqs. (1) and (5). Therefore, the largest importance value is customary assigned as 100 to scale the other values accordingly.

Principal component analysis

Different feature subsets, which are generated by the RFE using the BS model and the wise-patient fivefold CV, are transformed into component space by the PCA in which each feature subset corresponds to a CFC. Obviously, high correlation between features in the original feature subset is the main reason for the utility of PCA transformation to obtain the component subset in which individuals are uncorrelated significantly.

More precisely, the training set with n features, known as observed variables $F_{n}$, is transformed into a different space including n principal components $C_{n}$, which are independent and uncorrelated variables. First, the features are standardized to make them independent on measurement scale. Then, a feature is given as follows:

$$\begin{aligned} F_{m} = a_{m1}C_{1}+a_{m2}C_{2}+ \cdots +a_{ml}C_{l}+ \cdots +a_{mn}C_{n}. \end{aligned}$$

(11)

Also, the principal components, which are used as the CFC for further model selection and feature estimation, are computed as a linear combination of the original features as follows:

$$\begin{aligned} C_{l} = b_{1l}F_{1}+b_{2l}F_{2}+ \cdots +b_{ml}F_{m}+ \cdots +b_{nl}F_{n}. \end{aligned}$$

(12)

The component $C_{l}$ are uncorrelated with each other and ordered by the sample variance $\lambda$, that is, the largest sample variance represents for $C_{1}$, the second largest sample variance is for $C_{2}$, and so on. The sample variances corresponding to different components are known as eigenvalues, which show the share proportion in the total variance. In details, a component including a larger share of the total variance, which represented by a larger eigenvalue, in the original features is more important than the others with smaller eigenvalues. The covariance matrix Cov of the original feature F is as follows:

$$\begin{aligned} Cov=\begin{bmatrix} 1 &{} x_{12} &{} ...&{} x_{1j} &{} ...&{} x_{1n}\\ x_{21} &{} 1 &{} ... &{} x_{2j} &{} ...&{} x_{2n}\\ . &{} . &{} ... &{} . &{} ... &{}.\\ x_{i1} &{} x_{i2} &{} ... &{} x_{ij} &{} ... &{} x_{in}\\ . &{} . &{} ... &{} . &{} ... &{}.\\ x_{n1} &{} x_{n2} &{} ... &{} x_{nj} &{} ... &{} 1 \end{bmatrix}, \end{aligned}$$

(13)

where $x_{ij}$ is the correlation of $F_{i}$ and $F_{j}$. The coefficients $b_{ml}$ with $m = [1:n]$ for component $C_{l}$ contain the eigenvector corresponding to the $l{\text {th}}$ largest eigenvalue $\lambda _{l}$. Due to standardization, the total variance of all features equals to the number of features as follows:

$$\begin{aligned} n = \lambda _{1} + \lambda _{2} + \cdots + \lambda _{n}. \end{aligned}$$

(14)

Hence, $\lambda _{l}/n$ is the proportion of the total variance for the $l{\text {th}}$ component. It is noteworthy that the similar number of features and components remains the entire information during transformation process.

Model selection

All the CFCs are fed into different ML and DL models to search for the optimal learning and structure parameters on the training set. Obviously, hyper-parameter tuning is necessary for the identification of the optimal models, which contributes significantly to the avoidance of the overfitting problem. Furthermore, the selected models corresponding to the CFCs are estimated for their classification performance on the validation set. In this work, we use the grid search in combination with the wise-patient fivefold CV method to address the optimal parameter values for the models using entire CFCs.

Feature estimation

The optimal models, which are the output of the model selection phase, are estimated the detection performance on the validation set using the CFCs. In addition, the wise-patient fivefold CV procedure is also implemented to obtain reliable simulation results. There are two steps of the above procedure, which are data separation and model validation. For the former, the validation set is separated into 5 subsets of the records in which a subset is the testing data while others are used for training the models. Training and testing of the models are implemented repeatedly 5 times to make every subset become the testing data in the latter. The wise-patient CV procedure is run 30 times to compute the mean and standard deviation of the classification performance. A final feature subset corresponding to a CFC used as the input of the model, which produces the highest detection accuracy among others, is selected. The SAA design including such a final feature subset and the above model is proposed for the practical AED.

Results

Performance measure

A number of measures namely accuracy (Ac), sensitivity (Se), specificity (Sp), and balanced error rate (BER) are used for performance estimation of the ML models using the CFCs. The Ac measures the number of ECG segments identified correctly. The shockable and non-shockable ECG segments addressed correctly are represented by the Se and Sp. BER is computed as 1 − 0.5(Se + Sp).