Critical Transitions in Intensive Care Units: A Sepsis Case Study

Ghalati, Pejman F.; Samal, Satya S.; Bhat, Jayesh S.; Deisz, Robert; Marx, Gernot; Schuppert, Andreas

doi:10.1038/s41598-019-49006-2

Download PDF

Article
Open access
Published: 09 September 2019

Critical Transitions in Intensive Care Units: A Sepsis Case Study

Pejman F. Ghalati¹^na1,
Satya S. Samal¹^na1^nAff3,
Jayesh S. Bhat¹,
Robert Deisz²,
Gernot Marx² &
…
Andreas Schuppert¹

Scientific Reports volume 9, Article number: 12888 (2019) Cite this article

5149 Accesses
12 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The progression of complex human diseases is associated with critical transitions across dynamical regimes. These transitions often spawn early-warning signals and provide insights into the underlying disease-driving mechanisms. In this paper, we propose a computational method based on surprise loss (SL) to discover data-driven indicators of such transitions in a multivariate time series dataset of septic shock and non-sepsis patient cohorts (MIMIC-III database). The core idea of SL is to train a mathematical model on time series in an unsupervised fashion and to quantify the deterioration of the model’s forecast (out-of-sample) performance relative to its past (in-sample) performance. Considering the highest value of the moving average of SL as a critical transition, our retrospective analysis revealed that critical transitions occurred at a median of over 35 hours before the onset of septic shock, which suggests the applicability of our method as an early-warning indicator. Furthermore, we show that clinical variables at critical-transition regions are significantly different between septic shock and non-sepsis cohorts. Therefore, our paper contributes a critical-transition-based data-sampling strategy that can be utilized for further analysis, such as patient classification. Moreover, our method outperformed other indicators of critical transition in complex systems, such as temporal autocorrelation and variance.

AI-enabled electrocardiography alert intervention and all-cause mortality: a pragmatic randomized clinical trial

Article 29 April 2024

MetaboAnalystR 4.0: a unified LC-MS workflow for global metabolomics

Article Open access 01 May 2024

Principal component analysis

Article 22 December 2022

Introduction

Certain biological systems exhibit nonlinear dynamics that undergo sudden regime transitions at tipping points^1,2. In a medical context, these transitions often indicate changes in clinical phenotypes, e.g., disease-onset³. Such phenomena have been studied mathematically with techniques from the application of singularity theory to dynamical systems^4,5,6. In addition, data-driven methods use statistical indicators known as early-warning signals to model the dynamics of systems approaching transitions^{7,8,9,10,11,12,13,14}. Modeling such transitions is beneficial for several applications in systems medicine, such as monitoring health^15,16, predicting disease-onset and gaining an improved understanding of the underlying disease progression¹⁷.

Our focus is on sepsis, a common complication in the intensive care unit (ICU), and we introduce a notion of regime transition in septic dynamics. As stated in the Third International Consensus Definitions of Sepsis and Septic Shock (Sepsis-3), “sepsis is a life-threatening organ dysfunction caused by a dysregulated host response to infection”, and “septic shock is a subset of sepsis in which underlying circulatory and cellular/metabolic abnormalities are profound enough to substantially increase mortality¹⁸”. Sepsis causes a high rate of in-hospital mortality and costs the healthcare sector billions due to rising incidence rates and prolonged hospital stays^19,20. Accurate diagnosis, however, remains a challenging task for physicians due to the heterogeneity of infectious agents and the frequent presence of multiple comorbidities. Early, aggressive administration of antibiotics is crucial, and delays in this treatment significantly increase mortality^21,22.

To detect signs of sepsis early, numerous illness severity scores or early-warning signals exist: the Acute Physiology and Chronic Health Evaluation (APACHE II), the Simplified Acute Physiology Score (SAPS II), the Sepsis-related Organ Failure Assessment Score (SOFA), the Modified Early Warning Score (MEWS), and the Simple Clinical Score²³. These scores are good predictors of general disease severity and mortality but cannot estimate the risk of developing sepsis with reasonable sensitivity and specificity²³.

Numerous machine learning (ML) methods were therefore developed to predict sepsis onset^24,25,26. Rothman et al.²⁷ used structured information from electronic health records (EHRs) to identify sepsis on admission or to predict its onset during hospitalization. For septic shock prediction, Ghosh et al.²⁸ proposed an integrative model combining sequential contrast patterns with coupled hidden Markov models. Henry et al.²³ developed a targeted real-time early-warning score (TREWScore) by training a Cox regression model to identify patients at high risk of developing septic shock. Additionally, Horng et al.²⁹ argued that combining free-text patient data with other predictor features significantly improved the performance of ML models. Although these ML approaches have the potential to increase diagnostic accuracy, they involve time-consuming and domain-specific variable/feature selection^30,31. Our proposed method can be considered in the preprocessing stages to select appropriate data for further downstream analysis.

Our computational method aims to identify and characterize signals indicative of critical transitions based on the concept of surprise loss (SL)³². SL was originally developed in econometrics to assess forecast breakdown, i.e., instability in the model’s forecasting ability. Such instability was attributed to instability in the underlying data-generating process, whose effects have been studied from a mathematical perspective^33,34. We assume that similar instability occurs in patient data because of changes in the underlying biological mechanism due to medical intervention or disease progression.

We utilize SL to identify regions in the time series where the data-generating process changes and quantify them with a numerical score. The score captures the extent of deviation between the past performance of a model and its future performance. We consider the highest value of such a score to be a putative tipping point in the disease dynamics, and we consider it as a surrogate for critical transition. In addition, we present a critical-transition-based data-sampling strategy is also presented where data are sampled at regions around critical transition; this strategy outperforms random sampling in differentiation between septic shock and non-sepsis patients. We also compare our approach to methods based on autocorrelation and variance^7,15,16,35, which have been used to identify early-warning signals of critical transitions.

Materials and Methods

Data source

We sourced patients’ multivariate time series data from the publicly available EHR database, Medical Information Mart for Intensive Care MIMIC-III v1.4³⁶, which contained longitudinal data of 46,520 deidentified patients from 58,976 distinct ICU admissions. For ease of interpretation, we treated each admission as a distinct patient.

In the ICU, clinical staff make swift decisions or take prompt actions during patient management. These employees prioritize timely and correct treatment over consistent documentation of their processes, thereby limiting the reliability of clinical annotation for retrospective analysis. Furthermore, the execution of guidelines for identifying imminent disease varies across hospitals. Hence, we restricted our data analysis to predominantly machine-recorded quantitative variables.

Decision rules for retrospective annotation of the sepsis syndrome have evolved over the decades as knowledge of its pathophysiology and epidemiological impact have increased³⁷. Whereas earlier definitions (1991³⁸, 2001³⁹) focused on uncontrolled systemic inflammation as the major indicator, the latest 2016¹⁸ definition, commonly known as Sepsis-3, emphasizes organ dysfunction as the leading effect of the sepsis syndrome and proposes to update the International Classification of Diseases (ICD) coding system^40,41 (ICD-9: 995.92, 785.52; ICD-10: R65.20, R65.21). SOFA scoring system grades the extent of organ dysfunction and is calculated every 24 hours during a patient’s ICU stay^42,43.

Because the ICD-9 codes in our data were not compatible with Sepsis-3, we annotated the patient data in accordance with Table 2 from the 2016 consensus definition¹⁸. Fig. 1 illustrates a general schematic of our annotation framework.

The annotation framework was applied to all 58,976 patients, identifying 22,547 (38.2%) sepsis patients and 3208 (5.4%) septic shock patients. Among the 3208 septic shock patients, we analyzed only adults (18+ years old at admission) with at least a 36-hours stay and at most 144 hours spent in the unit before onset, which generated a cohort of 630 patients. Our non-sepsis cohort comprised 6,236 patients who lacked Sepsis-3 annotation or sepsis-specific ICD-9 codes and who stayed between 36 and 144 hours in the ICU. Demographic information on the two cohorts can be found in Supplementary Table S1.

We cannot exhaustively evaluate and validate the accuracy of our annotation framework owing to the absence of a manually curated “ground truth” dataset of Sepsis-3 patients. Software implementations with different data cleaning processes and patient exclusion criteria (PEC) from the same annotation framework could result in divergent cohorts. For example, for the same database, another implementation⁴⁴ annotated almost half (49.1%) of their analysis cohort (n = 11,791; reasonable PEC) as Sepsis-3, whereas our implementation annotated approximately 38% of the entire population (n = 58,976; no PEC). There may be a high degree of overlap in the annotated cohorts; thus, a comparison of the two implementations is currently under way.

Based on availability and relevance to sepsis, we preselected groups of variables: the laboratory variables included bicarbonate, creatinine, blood urea nitrogen (BUN), hematocrit, hemoglobin, platelet count, white blood cell count (WBC), potassium, and sodium; the vital signs and physiological variables comprised body temperature, heart rate, respiratory rate, oxygen saturation (SpO2), arterial blood pressure (systolic, mean, and diastolic), and urine output; the two septic markers comprised the shock index (ratio of heart rate over systolic blood pressure), and the ratio of BUN to creatinine²³. Table 1 shows the mean sampling rates of the variables in the respective patient cohorts, and their distribution can be seen in Supplementary Fig. S1.

Table 1 Mean sampling rate of the preselected clinical variables in the septic shock and non-sepsis cohorts.

Full size table

Missing value imputation and time binning

Data representation is a crucial step in analyzing time series. Continuous EHRs suffer from missing values due to insufficient data collection and lack of documentation. Additionally, high heterogeneity in variable type and irregular sampling intervals make such data difficult to handle. To address the problems of missing data and data sparsity, we transformed our time series into 30-minute time bins by imputing values in the bins and averaging measurement values over the bins. We experimented with different imputation methods, such as linear, polynomial and Stineman interpolation⁴⁵. The Stineman method was chosen due to its superior performance in reducing overshoots and handling sharp changes in the imputed values.

Data normalization

Our variables (Table 1) had different scales and measurement units. Data normalization was therefore needed for our method. For this purpose, we transformed the observables by Z-score normalization to address the use of different units of measurement.

State space model

To define SL, we require a dynamical mathematical model for our multivariate clinical time series. Here, we consider a state space model (SSM) approach^46,47, which models the data in a hierarchical manner with hidden states that give rise to observables. In our context, the hidden states can be assumed to represent the biological processes, and the observables represent the clinically measured variables. The observables in our SSM are expressed as linear combinations of hidden random states. Such a model incorporates the variations in the biological processes and a measurement noise term. The variations due to biology are modeled by adding a stochastic term to the hidden states, whereas the measurement noise term is added to the observables. Both terms are assumed to follow a multivariate normal (MVN) distribution.

The computation of SL is agnostic to the underlying dynamical model. The SL literature³² uses a linear dynamical model, whereas we use an SSM for our application. The primary reason to use this type of model is to separate the biological processes from the observables, i.e., to model two sources of variability. Below, we represent such an SSM model.

$$\begin{array}{llll}{x}_{t} & = & {x}_{t-1}+{w}_{t} & {\rm{where}}\,{w}_{t} \sim MVN\mathrm{(0,}\,Q),\,{x}_{0} \sim MVN(\pi ,\wedge )\\ {y}_{t} & = & Z{x}_{t}+a+{v}_{t} & {\rm{where}}\,{v}_{t} \sim MVN\mathrm{(0,}\,R)\end{array}$$

(1)

where the indices of the time series are from t = 1, …, T; e is the number of hidden trends; x is an e × T matrix of hidden states; y is an n × T matrix of n observables; and w is an e × T matrix of process error. In general, $e\ll n$. The process error at time t follows an MVN distribution with mean 0 and e × e covariance matrix Q; v is an n × T matrix of observation error. The observation error at time t follows an MVN distribution with mean 0 and n × n covariance matrix R; Z is an n × e parameter matrix; a is a vector of offsets; π is a matrix of e × 1 means; ∧ is an e × e covariance matrix. The set of parameters can be represented in compact form as θ = (Q, R, Z, x₁, …, _T, π, ∧), and their estimate is $\hat{\theta }$. ${\hat{y}}_{t}$ and ${\tilde{y}}_{t+\lambda }$ are the estimate and λ-step-ahead forecast, respectively, of the given observables y_t.

Our implementation incorporated MARSS^48,49, which is an R package for fitting constrained and unconstrained linear multivariate autoregressive SSMs by maximum likelihood parameter estimation. We utilized MARSS to fit an SSM to our multivariate time series data, using its recommended initial conditions that ensure parameter identifiability. We assumed the presence of multiple hidden states and fixed e = 3. Furthermore, we evaluated the robustness of our results with respect to the changes in the model parameters (see ‘Robustness of the SSM model’).

Perturbations in the dynamics

Early-warning indicator

Our proposed computational method based on surprise loss (SL)³² computes the difference between the forecast error, i.e., out-of-sample error, and the in-sample performance. The out-of-sample error measures the quality of model forecasts, i.e., the prediction of the model for the data that were not used for fitting, whereas the in-sample error quantifies the deviation between the model estimates and the data that were used for model fitting. A high out-of-sample error compared to the in-sample error is suggestive of instability in the patient data. In such a scheme, our model may be a poor fit for the data, but we are interested in evaluating whether the past performance of the model is consistent with future forecasts. The performance is measured for a fixed loss function using a moving time window. Furthermore, the SL computation is unsupervised, i.e., the clinical conditions of patients, such as septic shock or non-sepsis, are not required. Originally, the idea of SL was used to perform a statistical test to determine forecast breakdown in time series, i.e., to determine whether the average of SL is close to zero³². However, in our application, the aim is not to test whether a given time series underwent a statistically significant forecast breakdown; rather, it is to identify high SL values in the given time series and later use this information in postprocessing steps (see ‘Data-sampling strategy with SLMean’).

In spirit, this approach is close to the identification of structural breaks or change-points analysis^50,51. However, the SL-based approach has the additional advantage of being robust to model misspecification. Specifically, in practice, the SSM model (i.e., the functional form and variables) is likely to be misspecified and may not be a good approximation of the underlying disease processes. By formalizing SL as the difference between in-sample and out-of-sample performance and not relying on model parameters or error variances, the SL-based approach provides a natural way to handle such scenarios (see ‘Relationship with the literature’ in Giacomini et al.³²).

With a moving time window of width m, the SSM model (see equation (1)) was fitted for time indices $t-m+\mathrm{1,}\ldots ,t$. ${y}_{t}^{{i}_{c}}$ denotes the observables of a given patient i with clinical condition c at time index t, and ${T}^{{i}_{c}}$ is the length of the corresponding time series. The in-sample error is a quadratic loss function that averages the squared differences between the estimated and the given observables, and it is denoted as ${L}_{j}({\hat{\theta }}_{t}^{{i}_{c}})=\frac{1}{n}{\sum }_{k=1}^{n}\,{(y{(k)}_{j}^{{i}_{c}}-\hat{y}{(k)}_{j}^{{i}_{c}})}^{2}$ where $y{(k)}_{j}^{{i}_{c}}$ is the k^th element of column vector ${y}_{j}^{{i}_{c}}$. Similarly, the out-of-sample error is a quadratic loss function that averages the squared differences between the λ-step-ahead forecast and the given observables, and it is denoted as ${L}_{t+\lambda }({\hat{\theta }}_{t}^{{i}_{c}})=\frac{1}{n}{\sum }_{k=1}^{n}\,{(y{(k)}_{t+\lambda }^{{i}_{c}}-\tilde{y}{(k)}_{t+\lambda }^{{i}_{c}})}^{2}$. The SL is the difference between the out-of-sample and the in-sample error:

$$S{L}_{t+\lambda }^{{i}_{c}}={L}_{t+\lambda }({\hat{\theta }}_{t}^{{i}_{c}})-\frac{1}{m}\mathop{\sum }\limits_{j=t-m+1}^{t}\,{L}_{j}({\hat{\theta }}_{t}^{{i}_{c}})\,\,{\rm{for}}\,t=m,\ldots ,{T}^{{i}_{c}}-\lambda $$

(2)

To remove short-term fluctuations, a moving-average filter (with size δ) smooths the SL:

$$SLMea{n}_{t}^{{i}_{c}}=\mathop{\sum }\limits_{j=t-(m+\delta )+1}^{t}\,\frac{S{L}_{j}^{{i}_{c}}}{j}\,\,{\rm{for}}\,t=m+\delta ,\ldots ,{T}^{{i}_{c}}$$

(3)

For a given patient i, prior to the clinically annotated onset of disease c, a relatively high $SLMea{n}_{t}^{{i}_{c}}$ suggests putative transitions across dynamical regimes and serves as an early-warning indicator. We consider the maximum of $SLMea{n}^{{i}_{c}}$ at time index ${t}_{max}^{{i}_{c}}$ to denote a critical transition. Fig. 2a illustrates the calculation of $S{L}^{{i}_{c}}$, $SLMea{n}^{{i}_{c}}$ and ${t}_{max}^{{i}_{c}}$. A simulated example using synthetic data is shown in Fig. 3.

Uncertainty in SLMean

Uncertainty in out-of-sample forecasting and in-sample performance adds noise to the precise location of ${t}_{max}^{{i}_{c}}$. Let ${t}_{max(up)}^{{i}_{c}}$ and ${t}_{max(low)}^{{i}_{c}}$, respectively, be the time indices corresponding to the modes of the upper and lower bounds of the 95% prediction interval of SLMean. Our approach is robust if the deviations of ${t}_{max}^{{i}_{c}}$ from ${t}_{max(up)}^{{i}_{c}}$ and ${t}_{max(low)}^{{i}_{c}}$ are close to zero.

Data-sampling strategy with SLMean

Here, we demonstrate a method for sampling data from the critical transition points (derived from SLMean) to differentiate the septic shock cohort from the non-sepsis cohort (see Fig. 2b). We also propose a bootstrap test (based on a random sampling of data) to evaluate whether it outperforms the SL-based approach. Such a data selection step can be seen as a preprocessing step for the machine learning-based techniques being developed to study sepsis (as described in ‘Introduction’). The data sampling step is agnostic to the clinical condition of the patient, i.e., data for each patient are based on SL (see ‘Perturbations in the dynamics’), and in a subsequent step, we used the clinical condition to perform statistical tests.

Specifically, we selected the data at ${t}_{max}^{{i}_{c}}$, i.e., the critical transition points (in the case of multiple ${t}_{max}^{{i}_{c}}$ values, the one closer to the disease-onset was selected), sampled the corresponding data and represented them as an n × v variable matrix ${S}^{c}=[{y}_{{t}_{{\max }}}^{{1}_{c}},\ldots ,{y}_{{t}_{{\max }}}^{{v}_{c}}]$ where c ∈ {0, 1} i.e., non-sepsis and septic shock conditions, and v is the total number of patients. Thereafter, for each variable, a p-value based on Wilcoxon rank-sum test⁵² was calculated, quantifying the significance of differences between the two patient cohorts (as shown in the equation (4)).

$$p=(pval({S}_{1}^{0},{S}_{1}^{1}),\ldots ,pval({S}_{n}^{0},{S}_{n}^{1}))$$

(4)

where pval(.) returns the p-value based on the Wilcoxon rank-sum test. ${S}_{j}^{0}$ and ${S}_{j}^{1}$ denote the j^th row vectors of matrices S⁰ and S¹ matrices, respectively. Furthermore, we performed the Benjamini and Hochberg correction method to adjust the p-values⁵³ accounting for multiple comparisons.

Bootstrapping

Furthermore, a bootstrap test was used to compare the p-values calculated at critical transition points from the p-values that were obtained from random points in our time series. For a randomly selected time index t with its corresponding observation ${y}_{t}^{{i}_{c}}$, where $t\in \mathrm{(1,}\,{T}^{{i}_{c}})$, the t_random p-values were calculated by replacing t_max with t. The test was repeated 1000 times. Bootstrap frequency (BF) denotes the fraction of replications wherein t_max p-values were less than t_random p-values. A high BF value indicates that the SL based approach has an advantage over the random approach. In addition to computing the BF on data randomly sampled from all times, we computed BF on randomly sampled data of septic patients from two arbitrary time intervals, 36 hours and 18 hours before the onset of septic shock. This step allows us to test whether merely randomly sampling data close to the onset time can outperform the SL approach.

Autocorrelation and variance as early-warning signals

In the dynamics of a system, increased temporal autocorrelation and increased variance are hypothesized to be two indicators that the system is approaching a state transition⁷. To evaluate the SL concept, we calculated these two presumed early warning signals and compared the results with those obtained from the SL approach. As these measures are both univariate, to apply them to our multivariate time series data, we formulated them as follows:

$$AC{1}_{t}^{{i}_{c}}=\frac{1}{N}\mathop{\sum }\limits_{k=1}^{N}\,AC{(y{(k)}_{t})}^{{i}_{c}}\,\,{\rm{for}}\,t=m,\ldots ,{T}^{{i}_{c}}$$

(5)

$$VA{R}_{t}^{{i}_{c}}=\frac{1}{N}\mathop{\sum }\limits_{k=1}^{N}\,Var{(y{(k)}_{t})}^{{i}_{c}}\,\,{\rm{for}}\,t=m,\ldots ,{T}^{{i}_{c}}$$

(6)

where AC and AC1 are autocorrelation and variance functions applied on variable y(k) for time indices $t-m+\mathrm{1,}\ldots ,t$. t is the time index, and m is the width of a moving time window. The first coefficient of auto-correlation $AC{1}_{t}^{{i}_{c}}$ and variance $VA{R}_{t}^{{i}_{c}}$ were computed by averaging over N variables. i is the index of a given patient with clinical condition c, and ${T}^{{i}_{c}}$ is the length of the corresponding time series.

Similar to the SL concept, t_max is defined as the time index where the highest value of the early-warning signal occurs (here, the largest value of $AC{1}^{{i}_{c}}$ or $VA{R}^{{i}_{c}}$). P-values and bootstrap frequencies were computed as described in ‘SLMean-based data-sampling strategy’ and ‘Data-sampling strategy with SLMean’.

Software

To support reproducible research, our computational method is available at https://github.com/JRC-COMBINE/SL-MTS.

Results

SLMean as an early-warning indicator

Over a moving time window (m = 36, i.e., 18 hours; e = 3; λ-step-ahead = 1, i.e., 30 minutes; δ = 6, i.e., 3 hours), the $SLMea{n}^{{i}_{c}}$ values (‘Perturbations in the dynamics’), as shown in Fig. 4, were computed. A positive $SLMea{n}_{t}^{{i}_{c}}$ indicates higher out-of-sample error than in-sample error, signaling putative transitions in the underlying dynamics. The componentwise mean vector and associated standard deviation of all septic shock patients, i.e., $SLMea{n}^{{1}_{c}},\ldots ,SLMea{n}^{{N}_{c}}$ (where N is the total number of septic shock patients and c is the septic shock clinical condition), intensified as the moving time window approached the disease onset. For the same cohort of septic shock patients, a slight increase in the componentwise mean vector and associated standard deviation of $VA{R}^{{i}_{c}},\ldots ,VA{R}^{{N}_{c}}$ could be seen, while those of $AC{1}^{{1}_{c}},\ldots ,AC{1}^{{N}_{c}}$ did not show any changes over time. The findings are summarized in Fig. 5.

It should be taken into account that the largest $SLMea{n}_{t}^{{i}_{c}}$ need not necessarily occur exactly at the time of disease onset. For septic shock patients, the location of the time index t_max from the onset time (T) is shown in Fig. 6b. In the majority of our patients’ data, the highest SLMean occurred near septic shock onset (60% of the patients, the signal occurred less than 48 hours prior to onset, as shown in Fig. 6b). However, in some patients, the signal was observed beyond onset time. Possible explanations include a lack of records or a low sampling rate of variables a few days before the onset of septic shock, resulting in a nonsignificant SLMean. The highest SLMean, on average, occurred 46 hours (median of 35.6 hours) prior to the appearance of septic shock symptoms. In comparison, TREWScore²³ identified septic patients at a median of 28.2 hours before onset.

While the median time of the peak SLMean occurred at 35.6 hours before the onset of septic shock, visual inspection of the mean and standard deviation of SLMean indicates an upward trend starting from approximately 24 hours (Figs 5a and 6a). The explanation for the apparent deviation from the baseline is that the highest $SLMea{n}_{t}^{{i}_{c}}$ values that occurred closer to onset were greater in magnitude.

Furthermore, we determined the uncertainty in SL calculation using prediction intervals (as described in ‘Uncertainty in SLMean’). Our results show negligible deviation in t_max i.e., the median deviation is 0, and the interquartile range (IQR) is 5.4 hours.

SLMean-based data-sampling strategy

We compared the p-values for data sampled at t_max (i.e., critical transition point) to those obtained via random sampling (see equation (4) and ‘Data-sampling strategy with SLMean’). The same procedure was implemented for AC1 and VAR, and the bootstrap test was performed for all time indices. The bootstrap frequencies were denoted as BF (SL), BF (AC1) and BF (VAR), respectively (see Table 2). The different BF computations test the association of the bootstrap frequency values of some variables with high SLMean, AC1 and VAR. In 14 out of 19 variables, BF (SL) demonstrates superior results. In the next step, in addition to all the time indices, the bootstrap test was performed for time-windows of 18 and 36 hours before the onset of septic shock; the bootstrap frequencies are represented as BF (Full), BF (18 hours), and BF (36 hours). Fig. 7a plots BF (Full) against p-values computed at t_random and at high SLMean (i.e., t_max). Most of the variables show a good BF with high log-transformed p-values when sampled at large SLMean, particularly in the case of variables such as blood pressures, temperature and SpO2, where random sampling leads to poor p-values. As the random sampling strategy changed to either to 36 or 18 hours in Fig. 7b, BF reduced for six variables (WBC, diastolic blood pressure, Hemoglobin, SpO2, creatinine, and BUN), but it was preserved for nine variables (respiratory rate, heart rate, potassium, mean blood pressure, hematocrit, shock index, temperature, BUN-creatinine, and systolic blood pressure), i.e., the differences among BF (Full), BF (36 hours), and BF (18 hours) were small. Four variables, bicarbonate, urine output, platelets and sodium, had low BF (Full), BF (36 hours), and BF (18 hours).

Table 2 A statistical significance test (see ‘Data-sampling strategy with SLMean’) was performed to test whether the values of clinical variables at largest SLMean, AC1 and VAR were able to statistically differentiate septic shock patients from non-sepsis patients, and a bootstrap test was performed to calculate the fraction of replications where the t_max p-values were less than the p-values from a random sampling.

Full size table

Robustness of the SSM model

We assessed the robustness of our method to perturbations in the model parameters. We changed the length of the moving time window, m ∈ (24, 30), and the number of trends in the SSM model (e ∈ (4, 5)) and compared the changes in the t_max with respect to the reference setting, i.e., m = 36 and e = 3. The chosen values of e are based on the assumptions described in ‘State space model’ (e = 3 and $e\ll n$). The length of the moving time window was selected with regard to the average variables sampling rate (see Table 1, as well as the length of hospitalization in the ICU (see ‘Data source’). The differences in t_max due to the perturbations are summarized in online Supplementary Fig. S2. The zero median of such differences confirmed the robustness of our approach. Due to multiple similar high values SLMean in some patients, alteration of model parameters led to different t_max values in these patients, which caused the outliers in online Supplementary Fig. S2.

Discussions

Healthcare can benefit from the analysis of continuously monitored health data, which are rapidly growing in quantity due to the increasing availability of long time series collected either by wearables or by monitoring systems such as those established in the ICU. However, significant challenges remain unresolved. A major drawback is the restriction of data availability to variables that are easy to collect by noninvasive sensors. These variables provide only correlated surrogates of the primary disease-driving processes. Hence, sensor signals are rarely specific on their own; advanced computational processing is typically necessary to identify relevant signals to improve therapy.

Focusing data analysis on the prediction and identification of critical transitions, i.e., instabilities in patient data, may complement established scoring methods in the classification of stable states. Although critical transitions differ qualitatively from scores in classifying stable states, the former method provides an independent assessment of health status. Because critical transitions are simply identified through the evolution of individual longitudinal time series, in contrast to established scores based on absolute variable values, markers for the detection of critical transitions are relatively robust to normalization and data standardization issues.

To identify such critical transitions in ICU patients, we applied the concept of surprise loss (SL), which was originally developed for determining instability in a model’s forecasting ability in econometrics. We changed the model in the original SL approach to a multivariate SSM model to model two sources of variability, namely, the hidden underlying biological processes and the observables. Despite a multitude of interventions in the ICU, our moving average SL, SLMean, showed, on average, an increasing signal approximately 24 hours before the expert-annotated onset of septic shock (see Fig. 6a), thereby indicating its applicability as an early-warning indicator. We utilized such an indicator to devise a critical-transition-based data-sampling strategy for discriminating septic shock from non-sepsis patients. Additionally, through a bootstrap test (quantified through BF(Full)), the benefit of our method is shown with respect to a random data selection strategy (as summarized in Table 2 and Fig. 7a). Except for bicarbonate, urine output, platelets and sodium, the SL-based approach results in better p-values and BF(Full) than the random strategy. In addition, we selectively sampled random data from 36 hours and 18 hours before the septic onset to compute BF(36 h) and BF(18 h), respectively (see Fig. 7b and Supplementary Table S2). Such selective sampling evaluates whether merely sampling data close to the onset time of septic shock outperforms our method in distinguishing sepsis from non-sepsis. These new BF values seem to be well-preserved for most variables that have correspondingly high BF(Full). Therefore, an SL-informed sampling strategy may improve the quality of patient classification and eventually enable the reduction of sample sizes.

Moreover, from a systems theory point of view, mechanisms that control the system in homeostasis begin to collapse around a critical transition or tipping point. Consequently, variables that are under tight control within stable states may be more sensitive to systemic variability around an unstable point. Our data analysis supports this hypothesis (see Fig. 7a): some variables under tight control, e.g., blood pressure and body temperature, showed significant improvement in p-values compared to random sampling. We compared our method with two other univariate early-warning measures for critical transitions in complex systems: temporal autocorrelation and variance^7,15,16,35. As shown in Fig. 5, our method outperformed these estimators as an early-warning indicator for septic shock patients. Similarly, the p-values and BF of our method were also more favorable than those of the other methods (Table 2).

Conceptually, SL computation is based on the premise that the underlying system has a stable stationary state and that all observed deviations can be explained as responses to stochastic perturbations. The permissible amount of deviation is controlled by the system’s robustness at the time of computation. As a result, SL-based analysis can forewarn of a “loss of stability” even before the underlying system has changed its state. In that sense, SL provides indicators similar to those from the analysis of critical slowing down³⁵. One drawback is that local loss of robustness may neither result in a transition to another state nor indicate a new state. SL-based warning systems, in isolation, may thus lead to false alarms and could be improved by combining them with ML classifiers. Additionally, moving-window length restricts the capability of the SL-based warning system, and analysis can only be performed only when sufficient data have been collected. Hence, further evaluations must be performed towards utilization of SL-based analysis in practice. As a high SL is not specific and can be generated by any sudden event affecting the data, either errors in the monitoring system or health-related covariates, a robust characterization of the standard SL patterns characterizing control states is crucial. As sudden, high SL peaks can arise from sudden monitoring aberrations, we expect that a threshold-based alarm system might result in an unacceptable false positive rate. Therefore, emphasis should be placed on the characterization of SL patterns that are representative of the control state, eventually for each individual patient, followed by an AI-based pattern classifier. Effectively, this method will result in significant calibration times to setup the alarm system for each patient, such that effective training procedures for the learning of the control state patterns might be essential for transfer to clinical applications.

References

Mojtahedi, M. et al. Cell fate decision as high-dimensional critical state transition. PLOS Biol. 14, 1–28, https://doi.org/10.1371/journal.pbio.2000640 (2016).
Article CAS Google Scholar
Samal, S. S. et al. Metastable regimes and tipping points of biochemical networks with potential applications in precision medicine. bioRxiv, https://doi.org/10.1101/466714 (2018).
Trefois, C., Antony, P. M., Goncalves, J., Skupin, A. & Balling, R. Critical transitions in chronic disease: transferring concepts from ecology to systems medicine. Curr. Opin. Biotechnol. 34, 48–55, https://doi.org/10.1016/j.copbio.2014.11.020 (2015).
Article CAS PubMed Google Scholar
Thomas, R. & d’Ari, R. Biological feedback (CRC press, 1990).
Waddington, C. H. The strategy of the genes. London: Allen 86 (1957).
Arnold, V. I. Singularities, Bifurcations, and Catastrophes. In Catastrophe Theory, 1–2, https://doi.org/10.1007/978-3-642-58124-3 (Springer Berlin Heidelberg, Berlin, Heidelberg, 1992).
Book Google Scholar
Scheffer, M. et al. Early-warning signals for critical transitions. Nature 461, 53–59, https://doi.org/10.1038/nature08227 (2009).
Article ADS CAS PubMed Google Scholar
Guttal, V. & Jayaprakash, C. Changing skewness: an early warning signal of regime shifts in ecosystems. Ecol. Lett. 11, 450–460, https://doi.org/10.1111/j.1461-0248.2008.01160.x (2008).
Article PubMed Google Scholar
Dakos, V. et al. Slowing down as an early warning signal for abrupt climate change. Proc. Natl. Acad. Sci. 105, 14308–14312, https://doi.org/10.1073/pnas.0802430105 (2008).
Article ADS PubMed PubMed Central Google Scholar
Dakos, V., van Nes, E. H., Donangelo, R., Fort, H. & Scheffer, M. Spatial correlation as leading indicator of catastrophic shifts. Theor. Ecol. 3, 163–174, https://doi.org/10.1007/s12080-009-0060-6 (2010).
Article Google Scholar
Carpenter, S. R. & Brock, W. A. Rising variance: a leading indicator of ecological transition: Variance and ecological transition. Ecol. Lett. 9, 311–318, https://doi.org/10.1111/j.1461-0248.2005.00877.x (2006).
Article CAS PubMed Google Scholar
Chen, L., Liu, R., Liu, Z.-P., Li, M. & Aihara, K. Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers. Sci. reports 2, 342 (2012).
Article Google Scholar
Lenton, T. M., Livina, V. N., Dakos, V., Van Nes, E. H. & Scheffer, M. Early warning of climate tipping points from critical slowing down: Comparing methods to improve robustness. Philos. Transactions Royal Soc. A: Math. Phys. Eng. Sci. 370, 1185–1204, https://doi.org/10.1098/rsta.2011.0304 (2012).
Article ADS CAS Google Scholar
Chen, P., Liu, R., Li, Y. & Chen, L. Detecting critical state before phase transition of complex biological systems by hidden Markov model. Bioinformatics 32, 2143–2150, https://doi.org/10.1093/bioinformatics/btw154 (2016).
Article CAS PubMed Google Scholar
Gijzel, S. M. W. et al. Dynamical resilience indicators in time series of self-rated health correspond to frailty levels in older adults. 72, 991–996, https://doi.org/10.1093/gerona/glx065 (2017).
Article Google Scholar
Gijzel, S. M. W. et al. Dynamical indicators of resilience in postural balance time series are related to successful aging in high-functioning older adults, https://doi.org/10.1093/gerona/gly170 (2018).
Article Google Scholar
Liu, R., Chen, P., Aihara, K. & Chen, L. Identifying early-warning signals of critical transitions with strong noise by dynamical network markers. Sci. Reports 5, https://doi.org/10.1038/srep17501 (2015).
Singer, M. et al. The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). JAMA 315, 801, https://doi.org/10.1001/jama.2016.0287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fleischmann, C. et al. Epidemiology of sepsis in germany: Incidence, mortality and associated costs of care 2007-2013. Intensive Care Medicine Exp. 3, A50, https://doi.org/10.1186/2197-425X-3-S1-A50 (2015).
Article Google Scholar
Bhattacharjee, P., Edelson, D. P. & Churpek, M. M. Identifying Patients With Sepsis on the Hospital Wards. Chest 151, 898–907, https://doi.org/10.1016/j.chest.2016.06.020 (2017).
Article PubMed Google Scholar
Gaieski, D. F. et al. Impact of time to antibiotics on survival in patients with severe sepsis or septic shock in whom early goal-directed therapy was initiated in the emergency department*. Critical Care Medicine 38, 1045–1053, https://doi.org/10.1097/CCM.0b013e3181cc4824 (2010).
Article PubMed Google Scholar
Kumar, A. et al. Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock*. Critical Care Medicine 34, 1589–1596, https://doi.org/10.1097/01.CCM.0000217961.75225.E9 (2006).
Article PubMed Google Scholar
Henry, K. E., Hager, D. N., Pronovost, P. J. & Saria, S. A targeted real-time early warning score (TREWScore) for septic shock. Sci. Transl. Medicine 7, 299ra122–299ra122, https://doi.org/10.1126/scitranslmed.aab3719 (2015).
Article PubMed Google Scholar
Mitchell, S. et al. Optimization of sepsis risk assessment for ward patients. 107–112, https://doi.org/10.1109/SIEDS.2016.7489280 (IEEE, 2016).
Carrara, M., Baselli, G. & Ferrario, M. Mortality prediction in septic shock patients: Towards new personalized models in critical care. 2792–2795, https://doi.org/10.1109/EMBC.2015.7318971 (IEEE, 2015).
Tang, C. H. H. et al. Non-invasive classification of severe sepsis and systemic inflammatory response syndrome using a nonlinear support vector machine: a preliminary study. Physiol. Meas. 31, 775–793, https://doi.org/10.1088/0967-3334/31/6/004 (2010).
Article PubMed Google Scholar
Rothman, M. et al. Sepsis as 2 problems: Identifying sepsis at admission and predicting onset in the hospital using an electronic medical record–based acuity score. J. Critical Care 38, 237–244, https://doi.org/10.1016/j.jcrc.2016.11.037 (2017).
Article Google Scholar
Ghosh, S., Li, J., Cao, L. & Ramamohanarao, K. Septic shock prediction for ICU patients via coupled HMM walking on sequential contrast patterns. J. Biomed. Informatics 66, 19–31, https://doi.org/10.1016/j.jbi.2016.12.010 (2017).
Article Google Scholar
Horng, S. et al. Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning. Plos One 12, e0174708, https://doi.org/10.1371/journal.pone.0174708 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cook, R. D. Detection of influential observation in linear regression. Technometrics 19, 15–18, https://doi.org/10.1080/00401706.1977.10489493 (1977).
Article MathSciNet MATH Google Scholar
Guyon, I. & Elisseeff, A. An introduction to variable and feature selection. J. machine learning research 3, 1157–1182 (2003).
MATH Google Scholar
Giacomini, R. & Rossi, B. Detecting and predicting forecast breakdowns. Rev. Econ. Stud. 76, 669–705, https://doi.org/10.1111/j.1467-937X.2009.00545.x (2009).
Article MathSciNet MATH Google Scholar
Clements, M. & Hendry, D. Forecasting economic time series (Cambridge University Press, 1998).
Hendry, D. F. & Doornik, J. A. The implications for econometric modelling of forecast failure. Scott. J. Polit. Econ. 44, 437–461, https://doi.org/10.1111/1467-9485.t01-1-00065 (1997).
Article Google Scholar
Scheffer, M. et al. Quantifying resilience of humans and other animals. 115, 11883–11890, https://doi.org/10.1073/pnas.1810630115 (2018).
Article CAS Google Scholar
Johnson, A. E. W. et al. MIMIC-III, a freely accessible critical care database. Sci. Data 3, sdata201635, https://doi.org/10.1038/sdata.2016.35 (2016).
Abraham, E. New definitions for sepsis and septic shock: continuing evolution but with much still to be done. Jama 315, 757–759 (2016).
Article CAS PubMed Google Scholar
Bone, R. C. et al. Definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. Chest 101, 1644–1655 (1992).
Article CAS PubMed Google Scholar
Levy, M. M. et al. 2001 SCCM/ESICM/ACCP/ATS/SIS International Sepsis Definitions Conference. Critical Care Medicine 31, 1250–1256, https://doi.org/10.1097/01.CCM.0000050454.01978.3B (2003).
Article PubMed Google Scholar
Organization, W. H. ICD-10: The ICD-10 Classification of Mental and Behavioural Disorders: diagnostic criteria for research. In ICD-10: the ICD-10 classification of mental and behavioural disorders: diagnostic criteria for research (1993).
Statistics (US), N. C. f. H. The International Classification of Diseases: 9th Revision, Clinical Modification: ICD-9-CM (1991).
Vincent, J. L. et al. Use of the SOFA score to assess the incidence of organ dysfunction/failure in intensive care units: results of a multicenter, prospective study. Critical care medicine 26, 1793–1800 (1998).
Article CAS PubMed Google Scholar
Vincent, J. L. et al. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure (Springer, 1996).
Johnson, A. E. W. et al. A Comparative Analysis of Sepsis Identification Methods in an Electronic Database. Critical care medicine (2018).
Stineman, R. W. A Consistently Well Behaved Method of Interpolation. Creat. Comput. 6, 54–57 (1980).
Google Scholar
Harvey, A. C. Forecasting, structural time series models and the Kalman filter (Cambridge university press, 1990).
Auger-Méthé, M. et al. State-space models’ dirty little secrets: even simple linear Gaussian models can have estimation problems. Sci. Reports 6, 26677, https://doi.org/10.1038/srep26677 (2016).
Article ADS CAS Google Scholar
Holmes, E., Ward, E. & Wills, K. MARSS: Multivariate Autoregressive State-Space Modeling (2018).
Holmes, E. E., Ward, E. J. & Wills, K. MARSS: Multivariate autoregressive state-space models for analyzing time-series data. The R J. 4, 30 (2012).
Article Google Scholar
Andrews, D. W. K. Tests for parameter instability and structural change with unknown change point. Econometrica 61, 821–856 (1993).
Article MathSciNet Google Scholar
Chow, G. C. Tests of equality between sets of coefficients in two linear regressions. Econometrica 28, 591–605 (1960).
Article MathSciNet Google Scholar
Hollander, M., A. Wolfe, D. & Chicken, E. Nonparametric Statistical Methods: Hollander/Nonparametric Statistical Methods (John Wiley & Sons, Inc., Hoboken, NJ, USA, 2015).
Benjamini, Y. & Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. Royal Stat. Soc. Ser. B (Methodological) 57, 289–300 (1995).
MathSciNet MATH Google Scholar

Download references

Acknowledgements

The computing resources were granted by RWTH Aachen University under project rwth0260. S.S.S. was supported by funding from CompSE profile area, RWTH Aachen University. We wish to thank the anonymous reviewers whose constructive comments helped to improve the manuscript.

Author information

Satya S. Samal
Present address: BASF SE, Carl-Bosch-Strasse 38, 67056, Ludwigshafen am Rhein, Germany
Pejman F. Ghalati and Satya S. Samal contributed equally.

Authors and Affiliations

Joint Research Center for Computational Biomedicine, RWTH Aachen University, 52074, Aachen, Germany
Pejman F. Ghalati, Satya S. Samal, Jayesh S. Bhat & Andreas Schuppert
Klinik für Operative Intensivmedizin und Intermediate Care, Universitätsklinikum Aachen, 52074, Aachen, Germany
Robert Deisz & Gernot Marx

Authors

Pejman F. Ghalati
View author publications
You can also search for this author in PubMed Google Scholar
Satya S. Samal
View author publications
You can also search for this author in PubMed Google Scholar
Jayesh S. Bhat
View author publications
You can also search for this author in PubMed Google Scholar
Robert Deisz
View author publications
You can also search for this author in PubMed Google Scholar
Gernot Marx
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Schuppert
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.F.G. and S.S.S. developed the idea, conducted the research, and implemented the algorithms. J.S.B. helped in the preparation of the data and in proofreading of the article. R.D. and G.M. provided the clinical insights and interpreted the findings. A.S. supervised and supported the research project. All authors have reviewed the manuscript.

Corresponding author

Correspondence to Andreas Schuppert.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ghalati, P.F., Samal, S.S., Bhat, J.S. et al. Critical Transitions in Intensive Care Units: A Sepsis Case Study. Sci Rep 9, 12888 (2019). https://doi.org/10.1038/s41598-019-49006-2

Download citation

Received: 21 February 2019
Accepted: 14 August 2019
Published: 09 September 2019
DOI: https://doi.org/10.1038/s41598-019-49006-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.