Real-time machine learning model to predict in-hospital cardiac arrest using heart rate variability in ICU

Lee, Hyeonhoon; Yang, Hyun-Lim; Ryu, Ho Geol; Jung, Chul-Woo; Cho, Youn Joung; Yoon, Soo Bin; Yoon, Hyun-Kyu; Lee, Hyung-Chul

doi:10.1038/s41746-023-00960-2

Download PDF

Article
Open access
Published: 23 November 2023

Real-time machine learning model to predict in-hospital cardiac arrest using heart rate variability in ICU

npj Digital Medicine volume 6, Article number: 215 (2023) Cite this article

5588 Accesses
5 Citations
26 Altmetric
Metrics details

Subjects

Abstract

Predicting in-hospital cardiac arrest in patients admitted to an intensive care unit (ICU) allows prompt interventions to improve patient outcomes. We developed and validated a machine learning-based real-time model for in-hospital cardiac arrest predictions using electrocardiogram (ECG)-based heart rate variability (HRV) measures. The HRV measures, including time/frequency domains and nonlinear measures, were calculated from 5 min epochs of ECG signals from ICU patients. A light gradient boosting machine (LGBM) algorithm was used to develop the proposed model for predicting in-hospital cardiac arrest within 0.5–24 h. The LGBM model using 33 HRV measures achieved an area under the receiver operating characteristic curve of 0.881 (95% CI: 0.875–0.887) and an area under the precision-recall curve of 0.104 (95% CI: 0.093–0.116). The most important feature was the baseline width of the triangular interpolation of the RR interval histogram. As our model uses only ECG data, it can be easily applied in clinical practice.

Development of a machine learning-based clinical decision support system to predict clinical deterioration in patients visiting the emergency department

Article Open access 26 May 2023

Machine learning algorithms for predicting days of high incidence for out-of-hospital cardiac arrest

Article Open access 19 June 2023

Early heart rate variability evaluation enables to predict ICU patients’ outcome

Article Open access 15 February 2022

Introduction

In-hospital cardiac arrest is a sudden and unexpected complication in patients admitted to intensive care units (ICUs). Despite advancements in critical care medicine, the incidence of cardiac arrest in ICU patients remains high, with reported rates ranging 0.5–7.8% upon hospital admission¹. Early identification and rapid treatment are key to improving patient outcomes, but limited ICU resources and diverse causes of in-hospital cardiac arrest pose difficulties in preventing this life-threatening event². Thus, developing a continual and accurate prediction model for in-hospital cardiac arrest in ICU settings is critical for enabling real-time detection and prompt interventions, including early defibrillation and cardiopulmonary resuscitation (CPR), to improve patient outcomes.

Numerous prediction models rely on electronic medical records (EMR) to extract various clinical features for predicting cardiac arrest. Generally, such models have good discrimination performance. A retrospective cohort study of patients with acute coronary syndrome collected 20 clinical variables such as vital signs, laboratory results, and electrocardiogram (ECG) reports within 24 h before cardiac arrest to develop prediction models, and the best model achieved a better discrimination performance than existing risk prediction scores, such as the National and Modified Early Warning scores³. Another study utilized EMR to collect nine clinical variables, including chief complaints and demographic data, to develop a prediction model for cardiac arrest in emergency departments⁴. Since such prediction models are often limited by the need to collect multiple variables from EMR, some of the variables may not be immediately available or reliable, while others may be completely unavailable in certain hospitals⁵. Contrarily, ECG, widely used for continuous monitoring of critically ill patients, accelerated by recent machine learning (ML) algorithms is capable of detecting various cardiac abnormalities automatically⁶. Therefore, an ECG-based prediction model can simplify the process and ensure constant, real-time monitoring for early and rapid prediction of in-hospital cardiac arrest in real-world clinical settings.

Several ECG-based markers such as heart rate, QRS prolongation, early repolarization, and heart rate variability (HRV) have been associated with in-hospital cardiac arrest, leading to the development of cardiac arrest prediction models that use these markers as predictors^7,8,9. Among the aforementioned parameters, HRV, which is a measure of time variance between successive heartbeats (also known as RR intervals), has been identified as a promising predictor of cardiac arrest owing to its ability to evaluate the effects of autonomic nervous system activity in the heart^10,11,12,13. Several HRV measures including the standard deviation of normal RR intervals (SDNN) and low-frequency (LF) and high-frequency (HF) powers have been reported as significant predictors of cardiac arrest^11,14. Furthermore, a recent multi-center prospective cohort study suggested that HRV triangular index (HTI), calculated as the total number of RR intervals divided by their histogram height, can be an independent predictor of cardiac arrest¹⁵. However, these studies are limited by their focus on single HRV measures, overlooking the diverse information potentially offered by multiple HRV measures. Since all HRV measures originate from a single ECG source, the nature of HRV measures renders them cumbersome in conventional statistical models, including multivariable logistic regression.

The use of ML for developing prediction models has recently gained attention, as the models can learn complex relationships among several variables without requiring prespecified assumptions such as independence and linearity^16,17. In a previous study¹⁸, a prediction model for heart failure was developed and validated using various 12-lead ECG features, including QT interval, QRS duration, R wave axis, T wave axis, and heart rate, along with demographic data, and the presence of atrial fibrillation (AF) and atrial flutter. This model achieved an impressive AUROC of 0.889. Lai et al. predicted sudden cardiac death by utilizing ECG-derived measurable arrhythmic risk, specifically three repolarization interval ratios and two conduction-repolarization markers¹⁹. However, the limited sample size (n = 46) restricts its broader implications, despite its exceptional accuracy of 99.49%. A recent review highlighted multiple ML-based prediction models for cardiac arrest using ECG²⁰, but only two out of the 10 models used sample sizes exceeding 1000. One of these models, developed by Kwon et al., achieved an impressive AUROC of 0.948 for predicting cardiac arrest within 24 h, based on 12-lead ECG recordings from a substantial dataset of 25,672 patients²¹. The other model, created by Do et al. for predicting ventricular tachycardia, with an AUROC of 0.829, required a 3 h epoch of ECG data²². However, 12-lead ECG recordings may not be feasible for critically ill patients needing real-time monitoring. Therefore, HRV measures, easily obtainable from a single-lead ECG, have garnered attention. In a prospective observational study of 925 patients admitted to an emergency department, a support vector machine utilizing HRV measures with other clinical variables was found to be more accurate than a Modified Early Warning score in predicting cardiac arrest within 72 h, achieving an AUROC of 0.781²³. Although these studies highlighted the potential of ML algorithms using HRV measures for predicting cardiac arrest, few studies have developed ML-based prediction models for in-hospital cardiac arrest using multiple HRV measures from only ECGs in large samples of critically ill patients.

In this study, we develop and validate an ML-based prediction model for in-hospital cardiac arrest in ICU patients using HRV. We collect ECG data from a large sample, single-center, retrospective cohort and extract various HRV measures. Thereafter, we utilize a modern ML algorithm to capture the complex relationship among these measures and improve the predictive performance. To the best of our knowledge, this study is the first to use ML models to predict in-hospital cardiac arrest in ICU patients, using multiple HRV measures as predictors, and to validate the model on a large sample of patients.

Results

Dataset construction

A total of 5771 patients (6982 ICU stays) were eligible, of which 4821 patients (5679 ICU stays) were analyzed for developing and validating the proposed prediction model (Fig. 1). Patient demographics are listed in Table 1. The incidence of sudden cardiac arrest was 1.88%. The ECG data were preprocessed for quality checks, which resulted in 634,396 (1.24% event rate) and 139,663 (1.35% event rate) epochs in the development and validation sets, respectively. After the analysis of 43 HRV measurements, 33 were selected using the BorutaShap algorithm to develop the prediction model (Supplementary Fig. 1).

**Fig. 1: Flowchart of the study cohort.**

Table 1 Demographic characteristics of the study population at the ICU level.

Full size table

Model evaluation results

Following hyperparameter optimization through fivefold cross-validation, we retrained our light gradient boosting machine (LGBM) model on the entire development set and subsequently evaluated its performance on the validation set. As a primary outcome, the model achieved an area under the receiver operating curve (AUROC) of 0.881 [95% confidence interval (CI): 0.875–0.887] and an area under the precision-recall curve (AUPRC) of 0.104 [95% CI: 0.093–0.116] (Fig. 2). The AUROC of the secondary outcomes were comparable to that of the primary outcome, while the AUPRC declined as the range of the prediction period narrowed and neared the event of sudden cardiac arrest. Additional metrics assessing the discriminative performance of the model, including sensitivity, specificity, precision, accuracy, and F1-score, are presented in Table 2. Considering the calibration performance, our model overpredicted in the 0.2–0.3 range of predicted probability for the primary outcome (Fig. 3). For the secondary outcomes, our model exhibited consistent and reliable calibration at both 18 and 12 h. However, as the prediction period narrowed (from 6 to 1 h), the model increasingly overpredicted sudden cardiac arrests. The results of subgroup analyses, stratified by the type of patient monitor, are detailed in Table 3. The AUROCs of our model did not show significant variation between the two types of patient monitors.

**Fig. 2: Receiver operating characteristic and precision-recall curves that represent the discrimination performance of the best model on the validation set.**

Table 2 Discrimination performance of sudden cardiac arrest.

Full size table

Table 3 Discrimination performance of sudden cardiac arrest stratified by patient monitor type.

Full size table

Comparative analysis

To provide context, a clinical parameter-based model was also developed for comparative purposes. All stages of developing the clinical parameter-based model, including feature selection, model development, and validation, mirrored those of our model. The BorutaShap algorithm selected 42 features, excluding only one (the difference feature of diastolic blood pressure). Our model showed a significantly higher AUROC than the clinical parameter-based model (0.881 vs. 0.735, p < 0.001). Additional metrics evaluating the discriminative performances of the model are presented in Supplementary Table 1.

Feature importance analysis

The feature importance of our model was analyzed using the Shapley additive explanations (SHAP) method (Fig. 4). The most important feature, as determined by the SHAP values, was the baseline width of the triangular interpolation of the RR interval histogram (TINN), followed by HTI, the inverse of the average length of the acceleration/deceleration segments (IALS), 20th percentile of the RR intervals (Prc20NN), the minimum of the RR intervals (MinNN), and the interquartile range of the RR intervals (IQRNN). Among these features, a higher IALS value, as well as lower values of TINN, HTI, Prc20NN, and MinNN, were associated with a high risk of in-hospital cardiac arrest.

**Fig. 4: Shapley additive explanation dependence determines the relationship between the value of a feature and the predicted outcome of the model.**

Change of HRV measures over time until the event

An additional analysis was conducted to reveal the changes in HRV measures before an in-hospital cardiac arrest, specifically within the timeframe of 0.5 h to 24 h preceding the cardiac arrest event. The results highlighted the top six important features in our model (Fig. 5). The HTI values increased until the event of cardiac arrest, whereas IALS and MinNN values decreased. TINN, Prc20NN, and IQRNN values started to increase at ~6 h before the event of cardiac arrest (Supplementary Fig. 2).

**Fig. 5: Changes in key HRV measures over time until the event.**

Discussion

Recognizing the need for predicting in-hospital cardiac arrest in critically ill patients, we developed and validated an ML-based prediction model for in-hospital cardiac arrest using HRV measures in ICU patients. Our model leveraged HRV measures to overcome limitations encountered with conventional prediction models that rely on extensive EMR data. The proposed model not only simplifies the prediction process through a single data source but also facilitates real-time, continuous monitoring. The results demonstrated the potential of the LGBM model, which achieved good discrimination performance. This was paramount for the early detection and rapid prediction of in-hospital cardiac arrest, thereby improving patient outcomes in real-world clinical settings. This study highlights the (1) ability of the proposed model to predict the risk of in-hospital cardiac arrest using ECG data only, (2) usability of multiple HRV measures in the proposed ML-based model, and (3) explainability of the model through HRV measures.

In this study, only ECG data are used to predict the risk of in-hospital cardiac arrest, making our proposed model highly accessible and transferable to other healthcare settings that collect ECG data because continuous ECG monitoring is a standard practice in ICU settings. Unlike previous studies that employed multiple data sources, such as demographic information, vital signs, and laboratory results, to develop their prediction models^4,23,24,25, our model finds easy application in clinical practice because only ECG data are required to predict cardiac arrest in ICU settings. Additionally, we conducted a comparative analysis between our model and a clinical parameter-based model from a previous study, which utilized 43 features derived from six vital signs. While the clinical parameter-based model achieved an impressive AUROC of 0.94 for predicting in-hospital cardiac arrest within 1 h, our findings indicated that its performance was not consistently maintained when predicting events occurring within 24 h (Supplementary Table 1).

Since HRV quantifies dynamic changes in ECG signals, previous studies utilized HRV measures to develop models in various medical contexts, including the prediction of poor outcomes or treatment responses^26,27,28. However, such studies used traditional statistical models, such as a multivariable logistic regression model, which limited the number of HRV measures that could have been used owing to the linearity assumption between predictors and outcomes²⁹. Contrarily, ML-based models handle complex relationships among predictors and outcomes, thus offering the advantage of using numerous other HRV measures, including IALS, pNN50, TINN, and HTI, in the model development process, in addition to the traditional sets of HRV measures, such as the mean of the RR intervals (meanNN), SDNN, LF, and HF²⁹. Furthermore, ML-based models provide a distinct advantage while managing the inherently nonlinear and nonstationary fluctuations of HRV measures²⁹. In our study, we utilized nonlinear measures including IALS, TINN, and HTI, which have been proven effective in detecting diseases such as end-stage renal disease, primary aldosteronism, and pulmonary hypertension^30,31,32. The integration of these nonlinear HRV measures into ML algorithms proved to be of great potential in delivering superior discriminative performance. This observation was consistent with those of previous studies on different diseases³³, thereby further endorsing the effectiveness of the proposed approach.

In this study, the BorutaShap algorithm was employed to identify the most relevant HRV measure from 43 HRV measures, resulting in the selection of 33 HRV measures as input features for the model. Utilizing such a comprehensive set of HRV measures increased the accuracy and robustness of the prediction model. The feature importance analysis results determined using the SHAP method revealed that TINN, HTI, IALS, Prc20NN, MinNN, and IQRNN were the most critical HRV measures in the in-hospital cardiac arrest prediction model.

TINN, standing as the most pivotal feature in our study, was closely followed by HTI. Both TINN and HTI are time-domain HRV measures derived from geometric analysis, providing insights into the overall shape and distribution of the RR interval histogram¹⁰. TINN quantifies the baseline width of the distribution of RR intervals using triangular interpolation, where the triangle is determined by the least squares error. A larger TINN value typically signifies greater variability in the RR intervals. Conversely, HTI reflects the total number of RR intervals divided by the height of these intervals, shedding light on how the RR intervals are distributed. A lower HTI suggests that a higher proportion of intervals cluster around the mode, while a higher HTI indicates a wider spread of intervals. Notably, previous research has emphasized the importance of both HTI and TINN in cardiac risk assessment. Studies have shown that these values tend to be significantly lower in patients with sudden cardiac death compared to those with hypertrophic cardiomyopathy or healthy individuals³⁴. Additionally, in the context of developing prediction models for cardiac arrest in critically ill patients, TINN and HTI values have been found to be lower in patients experiencing cardiac arrest compared to those without²³. These values have also exhibited distinctions in patients with arrhythmias compared to healthy individuals, with a notable difference in HTI values between these groups³⁵. Furthermore, a previous study proved HTI to be an independent predictor of cardiovascular mortality in patients with AF¹⁵.

New HRV measures introduced in recent studies were applied to this study. A new HRV measure known as heart rate fragmentation or IALS was identified as one of the important features of our study. For IALS, acceleration, and deceleration segments were defined by a sequence of RR intervals between consecutive inflection points, for which the difference between the two RR intervals was <0 and >0, respectively. Segment length was determined as the number of RR intervals in that segment³⁶. A prior study revealed that IALS was significantly higher in patients with congestive heart failure (CHF), with a mean IALS of 0.78. This result is similar to that of our study (Fig. 5), suggesting that higher IALS can be associated with compromised cardiac conditions³⁷. Approximately 30–50% of the patients with CHF were estimated to be at risk of sudden cardiac arrest³⁸.

Few studies have used the other HRV measures included in our study, such as IQRNN, to study the relationship between those measures and cardiac arrest; however, our findings suggested that IQRNN has the potential as predictors of cardiac arrest. The values of IQRN, as well as TINN in patients experiencing sudden cardiac arrest, remained similar to those in patients without sudden cardiac arrest up until approximately 6 h prior to the event, after which dynamic changes occurred. Nevertheless, the causality between these HRV measures and cardiac arrest requires further investigation.

Changes in HRV measures were analyzed within the timeframe of 0.5 h to 24 h preceding the in-hospital cardiac arrest and compared with their median values in patients without in-hospital cardiac arrest, as shown in Fig. 5. The IALS values were consistently higher within 0.5 to 24 h preceding the cardiac arrest event compared to the patients without in-hospital cardiac arrest; however, there was a decreasing trend in these values leading up to the cardiac arrest event. Conversely, HTI values started low, but increased towards the event of cardiac arrest. These consecutive changes in HRV measures have not been documented in previous studies. Therefore, the analytical results of this study are expected to provide valuable insights into the real-time condition evaluation of a patient and facilitate the prompt initiation of interventions aimed at preventing events of cardiac arrest.

Furthermore, this study has the significant advantage of utilizing a large sample size of ~5000 patients, which adds to the representativeness and generalizability of the results to other patient populations. A large sample size is critical for accurately detecting rare events, such as in-hospital cardiac arrest, which is essential for developing reliable ML-based predictive models^39,40.

Nevertheless, the limitations of this study must be considered while interpreting the results. The binary classification model used has certain restrictions; the model can only predict whether a patient will experience a cardiac arrest but does not provide information on the timing of the event; however, we tried to evaluate our model on different time periods as secondary outcomes. Additionally, the model does not account for the influence of treatment interventions on outcomes and focuses solely on baseline predictors. The selection of the development and validation sets may also have been biased, which can affect the accuracy and generalizability of the results. Furthermore, the study was performed at a single center, limiting the transferability of the findings to other patient populations and healthcare systems.

Future research should focus on validating the findings of this study in larger multi-center studies to increase the generalizability of the results and confidence in the predictions made by the model. Open datasets with labels for cardiac arrest and ECG waveforms, such as the Medical Information Market for Intensive Care, can help validate our results before conducting a multi-center prospective study. Moreover, incorporating clinical factors such as comorbidities or medications may further assist the model⁴¹; however, we intentionally excluded these factors in this study considering variable availability across different hospital settings. Additionally, developing survival models that account for both the probability and timing of a cardiac arrest event is expected to provide valuable information for clinical decision-making and allow a better understanding of the long-term outcomes of patients who experience sudden cardiac arrest in ICU settings.

In conclusion, we developed and validated an ML-based real-time prediction model to predict in-hospital cardiac arrest in critically ill patients, focusing on the importance of HRV measures. If future prospective studies validate our results, they can potentially be used to detect in-hospital cardiac arrest in critically ill patients.

Methods

Study design

All data for model development were retrieved from a prospective registry containing vital signs of ICU patients at the Seoul National University Hospital (SNUH). The prospective registry was approved by the Institutional Review Board (IRB) of SNUH (approval number: 1408-101-605) and registered at ClinicalTrials.gov (NCT02914444). Furthermore, the IRB approved the retrospective analysis of data from the prospective registry (approval number: 2303-113-1413). Due to the retrospective nature of this study and the anonymity of data, the IRB waived off the requirement for written informed consent from patients.

Data collection

For this study, registry data of all the patients admitted to medical or surgical ICUs (MICU or SICU) at SNUH from March 2020 to August 2022 were eligible. However, patients under the age of 18 and those without ECG recordings were excluded from the study. The ECG data used in this study were collected using two different patient monitors (IntelliVue, Philips Healthcare, Amsterdam, Netherlands, and SolarTM 8000 M, GE Healthcare, Wauwatosa, WI, USA) and stored in a free biosignal collection program (version 1.9.9, accessed on June 6, 2022, https://vitaldb.net)¹⁷. The clinical parameters, used for comparative analysis, including heart rate, systolic blood pressure, diastolic blood pressure, mean blood pressure, SpO₂, and respiration rate, were also collected using the same patient monitors. The event time of CPR was extracted from the clinical data warehouse of SNUH (Supreme 1.0, Seoul National University Hospital, Seoul, Republic of Korea) to incorporate the presence and time of sudden cardiac arrests during each ICU stay. For ICU stays with multiple CPR attempts, only the first CPR event was used.

We constructed a structured ECG dataset of ICU stays with (event group) and without (control group) sudden cardiac arrests (Fig. 6). For the event group, ECG data from 0.5 to 24 h prior to the event were collected and 5 min epochs with 5 min intervals spanning 0.5–24 h were extracted. Each epoch begins immediately after the end of the previous one. For the control group, we randomly sampled ECG data for 24 h from each ICU stay and extracted 5 min epochs with 5 min intervals similar to the event group. Only data from the 5-min epochs were used in calculating the HRV features and predicting the event. Thereafter, the dataset was randomly divided into development (80%) and validation (20%) sets at the patient level, while maintaining the same ratio of groups in both sets.

**Fig. 6: Collection protocol of 5 min epoch within ECG data.**

Data preprocessing

The ECG signals were originally collected at a sampling rate of 500 Hz but were downsampled to 250 Hz to reduce the amount of data and computational resources required for processing. To ensure a more accurate HRV analysis, a series of preprocessing steps were applied to the ECG data: ECG signals were divided into 5 min epochs, signals were filtered to remove noise, and data quality was checked to ensure that the data were usable. Specifically, a 0.5 Hz high-pass Butterworth filter (order = 5) was used, followed by powerline filtering of the 5 min ECG signal during the first step. Next, a continuous quality index was computed by interpolating the distance of each QRS segment from the average QRS segment (1 corresponded to the heartbeats closest to the average sample, while 0 corresponded to the most distant heartbeat from the average sample). When the proportion of QRS segments with a quality index greater than or equal to 0.9 was not greater than 80%, a few 5 min epochs were excluded according to the quality index. For the clinical parameter-based model, which was developed based on a prior study²⁵, we initially established a total of 43 features. This set included six raw features, six differential features, 30 statistical features computed using a sliding window approach, and one additional feature known as the shock index. Because the clinical parameters were originally collected at 2 sec intervals, we extracted median values within each 5 min epoch for the raw features. The differential features were calculated as the differences between the values in the current epoch and those in the previous epoch for the clinical parameters. We applied a fixed-length 2-h sliding window to segment each parameter with a 5 min interval. These segments were then aggregated to statistical measures such as the mean, median, minimum, maximum, and standard deviation for each feature across all the segments.

Calculation of HRV parameters

The Neurokit2 Python library, a comprehensive and validated toolkit for ECG signal analysis, was utilized to detect R peaks and extract various HRV measures from each 5 min epoch, as employed in previous studies^42,43,44. The toolkit facilitated the calculation of HRV measures based on detected R peaks, ensuring reliable results through a standardized and automated approach. The HRV measures were calculated using the detected R-peak information. The toolkit provided a total of 74 HRV measures comprising 24 time-domain measures such as meanNN, SDNN, and square root of the mean of the squared successive differences between adjacent RR intervals (RMSSD); 9 frequency-domain measures such as the spectral power of LF, HF, and the ratio of LF to HF (LF/HF); and 41 nonlinear measures such as the standard deviation perpendicular to the line of identity (SD1), cardiac sympathetic index, and cardiac vagal index. In the nonlinear category, we preselected 15 measures, which were derived either from the Poincaré plot^45,46, or the heart rate fragmentation approach³⁶. Additionally, we excluded four time-domain HRV measures as they required ECG epochs longer than 5 min. Consequently, we began with a total of 43 HRV features as the initial feature candidates before applying the BorutaShap algorithm (Supplementary Fig. 1).

Measurement outcome

The primary outcome was the occurrence of cardiac arrest within 0.5–24 h, as in a previous study^3,4. The secondary outcomes included the occurrences of cardiac arrest from 0.5 to 18, 12, 6, 3, and 1 h. The discrimination performances of the model were evaluated using AUROC, AUPRC, sensitivity, specificity, precision, accuracy, and F1-score. To evaluate the calibration performance of the model, we graphed a calibration plot comparing the predicted probabilities of sudden cardiac arrest against the observed fractions.

Feature selection

For feature selection, we employed the BorutaShap algorithm because it uses a combination of the Boruta and SHAP algorithms to identify the most important features in a given dataset⁴⁷. Thus, the BorutaShap algorithm was applied to identify the most relevant features from the 43 predetermined HRV measures extracted for our model, and 43 clinical features extracted for the clinical parameter-based model, respectively. In this study, only the features categorized as “accepted” by the BorutaShap algorithm were chosen for inclusion in the model development process, with those marked as “tentative” or “rejected” being excluded.

Model development and validation

A LGBM model was utilized to develop the proposed ML-based prediction model, which is an implementation of the decision tree-based ensemble algorithm, with high efficiency, scalability, and strong performance on a wide range of datasets⁴⁸. The hyperparameters of the LGBM model were optimized using Bayesian optimization, which is an efficient approach to automatically tune ML algorithms by modeling the generalization performance of a learning algorithm as a sample from a Gaussian process and using the tractable posterior distribution to select the next optimal parameter for trial⁴⁹. Hyperparameter optimization was conducted with fivefold cross-validation at the patient level using the development set to find the best hyperparameters such as the number of leaves, fraction of features, and regularization determined using the AURPC score. Subsequently, we performed another round of fivefold cross-validation at the patient level to identify the optimal model training parameters including the number of boosting iterations. Once these optimal parameters were established, we proceeded to train the model using the entire development set. Following this, we implemented the Beta calibration method for further refinement⁵⁰. Following this, we implemented the Beta calibration method for further refinement. The final model, resulting from these steps, was tested with the validation set. All this learning scheme was applied to both our model and the clinical parameter-based model.

Feature importance

We employed the SHAP method⁵¹ to explain the output of the ML model based on a game-theoretic framework. Each feature was assigned a unique contribution value that indicated its impact on the prediction outcome. In the classic concept of Shapley values from cooperative game theory, SHAP values are grounded and a way to distribute the prediction outcomes fairly among all features is provided. Additionally, this method is used to determine the attributes of each feature to the predicted outcome applied to the validation set.

Statistical analysis

Kendall’s tau coefficient was used to measure the association between the time for the event and HRV measures. DeLong’s test was employed to compare the AUROCs⁵². All statistics were reported with point estimates and 95% CIs. Python 3.8.0 (Python Software Foundation, Wilmington, DE, USA) was used for signal preprocessing, model development, validation, statistical testing, and visualization. A p value < 0.05 was considered statistically significant.

Data availability

The dataset used in this study is not publicly available. However, the data of this study can be provided if there is a reasonable request to the corresponding author.

Code availability

The code to generate the result of this study can be accessed at https://github.com/HyeonhoonLee/hrvarrest.

References

Armstrong, R. A. et al. The incidence of cardiac arrest in the intensive care unit: a systematic review and meta-analysis. J. Intensive Care Soc. 20, 144–154 (2019).
Article PubMed Google Scholar
Penketh, J. & Nolan, J. P. In-hospital cardiac arrest: the state of the art. Crit. Care 26, 376 (2022).
Article PubMed PubMed Central Google Scholar
Wu, T. T., Lin, X. Q., Mu, Y., Li, H. & Guo, Y. S. Machine learning for early prediction of in-hospital cardiac arrest in patients with acute coronary syndromes. Clin. Cardiol. 44, 349–356 (2021).
Article PubMed PubMed Central Google Scholar
Hong, S., Lee, S., Lee, J., Cha, W. C. & Kim, K. Prediction of cardiac arrest in the emergency department based on machine learning and sequential characteristics: model development and retrospective clinical validation study. JMIR Med. Inform. 8, e15932 (2020).
Article PubMed PubMed Central Google Scholar
Song, X. et al. Cross-site transportability of an explainable artificial intelligence model for acute kidney injury prediction. Nat. Commun. 11, 5668 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhu, H. et al. Automatic multilabel electrocardiogram diagnosis of heart rhythm or conduction abnormalities with deep learning: a cohort study. Lancet Digit. Health 2, e348–e357 (2020).
Article PubMed Google Scholar
Haissaguerre, M. et al. Sudden cardiac arrest associated with early repolarization. N. Engl. J. Med. 358, 2016–2023 (2008).
Article CAS PubMed Google Scholar
Attin, M. et al. Electrocardiogram characteristics prior to in-hospital cardiac arrest. J. Clin. Monit. Comput. 29, 385–392 (2015).
Article PubMed Google Scholar
Thoren, A. et al. ECG-monitoring of in-hospital cardiac arrest and factors associated with survival. Resuscitation 150, 130–138 (2020).
Article PubMed Google Scholar
Task Force of the European Society of Cardiology and The North American Society of Pacing and Electrophysiology. Heart rate variability: standards of measurement, physiological interpretation and clinical use. Circulation 93, 1043–1065 (1996).
Lombardi, F. & Mortara, A. Heart rate variability and cardiac failure. Heart 80, 213–214 (1998).
Article CAS PubMed PubMed Central Google Scholar
Bodenes, L. et al. Early heart rate variability evaluation enables to predict ICU patients’ outcome. Sci. Rep. 12, 2498 (2022).
Article CAS PubMed PubMed Central Google Scholar
Natarajan, A., Pantelopoulos, A., Emir-Farinas, H. & Natarajan, P. Heart rate variability with photoplethysmography in 8 million individuals: a cross-sectional study. Lancet Digit. Health 2, e650–e657 (2020).
Article PubMed Google Scholar
La Rovere, M. T. et al. Short-term heart rate variability strongly predicts sudden cardiac death in chronic heart failure patients. Circulation 107, 565–570 (2003).
Article PubMed Google Scholar
Hammerle, P. et al. Heart rate variability triangular index as a predictor of cardiovascular mortality in patients with atrial fibrillation. J. Am. Heart Assoc. 9, e016075 (2020).
Article PubMed PubMed Central Google Scholar
Bzdok, D., Altman, N. & Krzywinski, M. Statistics versus machine learning. Nat. Methods 15, 233–234 (2018).
Article CAS PubMed PubMed Central Google Scholar
Moffat, L. M. & Xu, D. Accuracy of machine learning models to predict in-hospital cardiac arrest: a systematic review. Clin. Nurse Spec. 36, 29–44 (2022).
Article PubMed Google Scholar
Kwon, J. M. et al. Development and validation of deep-learning algorithm for electrocardiography-based heart failure identification. Korean Circ. J. 49, 629–639 (2019).
Article PubMed PubMed Central Google Scholar
Lai, D., Zhang, Y., Zhang, X., Su, Y. & Heyata, M. B. B. An automated strategy for early risk identification of sudden cardiac death by using machine learning approach on measurable arrhythmic risk markers. IEEE Access 7, 94701–94716 (2019).
Article Google Scholar
Kolk, M. Z. H. et al. Machine learning of electrophysiological signals for the prediction of ventricular arrhythmias: systematic review and examination of heterogeneity between studies. EBioMedicine 89, 104462 (2023).
Article PubMed PubMed Central Google Scholar
Kwon, J. M. et al. Artificial intelligence algorithm for predicting cardiac arrest using electrocardiography. Scand. J. Trauma Resusc. Emerg. Med. 28, 98 (2020).
Article PubMed PubMed Central Google Scholar
Do, D. H. et al. Usefulness of trends in continuous electrocardiographic telemetry monitoring to predict in-hospital cardiac arrest. Am. J. Cardiol. 124, 1149–1158 (2019).
Article PubMed PubMed Central Google Scholar
Ong, M. E. et al. Prediction of cardiac arrest in critically ill patients presenting to the emergency department using a machine learning score incorporating heart rate variability compared with the modified early warning score. Crit. Care 16, R108 (2012).
Article PubMed PubMed Central Google Scholar
Kwon, J. M., Lee, Y., Lee, Y., Lee, S. & Park, J. An algorithm based on deep learning for predicting in-hospital cardiac arrest. J. Am. Heart Assoc. 7, e008678 (2018).
Yijing, L. et al. Prediction of cardiac arrest in critically ill patients based on bedside vital signs monitoring. Comput. Methods Prog. Biomed. 214, 106568 (2022).
Article Google Scholar
Choi, K. W. & Jeon, H. J. Heart rate variability for the prediction of treatment response in major depressive disorder. Front. Psychiatry 11, 607 (2020).
Article PubMed PubMed Central Google Scholar
Arbo, J. E. et al. Heart rate variability measures for prediction of severity of illness and poor outcome in ED patients with sepsis. Am. J. Emerg. Med. 38, 2607–2613 (2020).
Article PubMed PubMed Central Google Scholar
Jarczok, M. N. et al. Heart rate variability in the prediction of mortality: a systematic review and meta-analysis of healthy and patient populations. Neurosci. Biobehav. Rev. 143, 104907 (2022).
Article PubMed Google Scholar
Shaffer, F. & Ginsberg, J. P. An overview of heart rate variability metrics and norms. Front. Public Health 5, 258 (2017).
Article PubMed PubMed Central Google Scholar
Lin, Y. H. et al. Heart rhythm complexity impairment in patients undergoing peritoneal dialysis. Sci. Rep. 6, 28202 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lin, Y. H. et al. Reversible heart rhythm complexity impairment in patients with primary aldosteronism. Sci. Rep. 5, 11249 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tsai, C. H. et al. Heart rhythm complexity impairment in patients with pulmonary hypertension. Sci. Rep. 9, 10710 (2019).
Article PubMed PubMed Central Google Scholar
Odenstedt Herges, H. et al. Machine learning analysis of heart rate variability to detect delayed cerebral ischemia in subarachnoid hemorrhage. Acta Neurol. Scand. 145, 151–159 (2022).
Article PubMed Google Scholar
Yan, S. P. et al. Performance of heart rate adjusted heart rate variability for risk stratification of sudden cardiac death. BMC Cardiovasc. Disord. 23, 144 (2023).
Article PubMed PubMed Central Google Scholar
Georgieva-Tsaneva, G. & Gospodinova, E. Heart rate variability analysis of healthy individuals and patients with ischemia and arrhythmia. Diagnostics (Basel) 13, 2549 (2023).
Article PubMed Google Scholar
Costa, M. D., Davis, R. B. & Goldberger, A. L. Heart rate fragmentation: a new approach to the analysis of cardiac interbeat interval dynamics. Front. Physiol. 8, 255 (2017).
Article PubMed PubMed Central Google Scholar
Wang, Y. The analysis of heart rate fragmentation for congestive heart failure. J. Phys.: Conf. Ser. 1213, 022027 (2019).
Google Scholar
Reinier, K. et al. The association between atrial fibrillation and sudden cardiac death: the relevance of heart failure. JACC Heart Fail. 2, 221–227 (2014).
Article PubMed Google Scholar
Biau, D. J., Kerneis, S. & Porcher, R. Statistics in brief: the importance of sample size in the planning and interpretation of medical research. Clin. Orthop. Relat. Res. 466, 2282–2288 (2008).
Article PubMed PubMed Central Google Scholar
Decherchi, S., Pedrini, E., Mordenti, M., Cavalli, A. & Sangiorgi, L. Opportunities and challenges for machine learning in rare diseases. Front. Med. (Lausanne) 8, 747612 (2021).
Article PubMed Google Scholar
Banerjee, A. et al. Identifying subtypes of heart failure from three electronic health record sources with machine learning: an external, prognostic, and genetic validation study. Lancet Digit. Health 5, e370–e379 (2023).
Article CAS PubMed Google Scholar
Makowski, D. et al. NeuroKit2: a Python toolbox for neurophysiological signal processing. Behav. Res. Methods 53, 1689–1696 (2021).
Article PubMed Google Scholar
Frasch, M. G. & Comprehensive, H. R. V. estimation pipeline in Python using Neurokit2: application to sleep physiology. MethodsX 9, 101782 (2022).
Article PubMed PubMed Central Google Scholar
Pham, T., Lau, Z. J., Chen, S. H. A. & Makowski, D. Heart rate variability in psychology: a review of HRV indices and an analysis tutorial. Sensors (Basel) 21, 3998 (2021).
Toichi, M., Sugiura, T., Murai, T. & Sengoku, A. A new method of assessing cardiac autonomic function and its comparison with spectral analysis and coefficient of variation of R-R interval. J. Auton. Nerv. Syst. 62, 79–84 (1997).
Article CAS PubMed Google Scholar
Yan, C. et al. Area asymmetry of heart rate variability signal. Biomed. Eng. Online 16, 112 (2017).
Article PubMed PubMed Central Google Scholar
Keany, E. BorutaShap: a wrapper feature selection method which combines the Boruta feature selection algorithm with Shapley value. https://zenodo.org/record/4247618 (2020).
Ke, G. et al. LightGBM: a highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 31, 3146–3154 (2017).
Google Scholar
Snoek, J. L., H. & Adams, R. P. Practical Bayesian optimization of machine learning algorithms. In: Adv. Neural Inf. Process. Syst. 25, 2951–2959 (2012).
Kull, M. S. F., T. M. Flach, P. Beta calibration: a well-founded and easily implemented improvement on logistic calibration for binary classifiers. In: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS) 54, 623–631 (2017).
Lundberg, S., & Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 30, 4765–4774 (2017).
Google Scholar
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44, 837–845 (1988).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was funded by the Korea Health Technology R&D Project through the KHIDI, funded by the Ministry of Health & Welfare, Republic of Korea (grant number: HI21C1074). This work was also supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (grant number: RS-2023-00211674).

Author information

Authors and Affiliations

Department of Anesthesiology and Pain Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Hyeonhoon Lee & Hyun-Lim Yang
Department of Data Science Research, Innovative Medical Technology Research Institute, Seoul National University Hospital, Seoul, Republic of Korea
Hyeonhoon Lee & Hyung-Chul Lee
Department of Medical Device Development Support, Innovative Medical Technology Research Institute, Seoul National University Hospital, Seoul, Republic of Korea
Hyun-Lim Yang
Department of Anesthesiology and Pain Medicine, Seoul National University College of Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Ho Geol Ryu, Chul-Woo Jung, Youn Joung Cho, Soo Bin Yoon, Hyun-Kyu Yoon & Hyung-Chul Lee
Department of Critical Care Medicine, Seoul National University Hospital, Seoul, Republic of Korea
Ho Geol Ryu

Authors

Hyeonhoon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hyun-Lim Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ho Geol Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Chul-Woo Jung
View author publications
You can also search for this author in PubMed Google Scholar
Youn Joung Cho
View author publications
You can also search for this author in PubMed Google Scholar
Soo Bin Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Hyun-Kyu Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Hyung-Chul Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L. and H.C.L. contributed substantially to the study conception and design, data acquisition, and data analysis. H.G.R. and Y.J.C. collected data. H.G.R. and C.W.J. curated data. H.L. and H.L.Y. conducted data analysis and made tables and figures. C.W.J., S.B.Y., and H.K.Y. interpreted the results of data analysis. H.L. and H.C.L. participated in drafting the article, and S.B.Y. and H.K.Y. revised it critically for important intellectual content. All authors gave final approval of the version to be published.

Corresponding author

Correspondence to Hyung-Chul Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, H., Yang, HL., Ryu, H.G. et al. Real-time machine learning model to predict in-hospital cardiac arrest using heart rate variability in ICU. npj Digit. Med. 6, 215 (2023). https://doi.org/10.1038/s41746-023-00960-2

Download citation

Received: 20 June 2023
Accepted: 05 November 2023
Published: 23 November 2023
DOI: https://doi.org/10.1038/s41746-023-00960-2

This article is cited by

„Data-driven-Intensivmedizin“: Mangel an umfassenden Datensätzen
- Jan-Hendrik B. Hardenberg
Medizinische Klinik - Intensivmedizin und Notfallmedizin (2024)

Subjects

Abstract

Similar content being viewed by others

Development of a machine learning-based clinical decision support system to predict clinical deterioration in patients visiting the emergency department

Machine learning algorithms for predicting days of high incidence for out-of-hospital cardiac arrest

Early heart rate variability evaluation enables to predict ICU patients’ outcome

Introduction

Results

Dataset construction

Model evaluation results

Comparative analysis

Feature importance analysis

Change of HRV measures over time until the event

Discussion

Methods

Study design

Data collection

Data preprocessing

Calculation of HRV parameters

Measurement outcome

Feature selection

Model development and validation

Feature importance

Statistical analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

„Data-driven-Intensivmedizin“: Mangel an umfassenden Datensätzen

Search

Quick links