A multimodal stacked ensemble model for cardiac output prediction utilizing cardiorespiratory interactions during general anesthesia

Dervishi, Albion

doi:10.1038/s41598-024-57971-6

Download PDF

Article
Open access
Published: 29 March 2024

A multimodal stacked ensemble model for cardiac output prediction utilizing cardiorespiratory interactions during general anesthesia

Albion Dervishi¹

Scientific Reports volume 14, Article number: 7478 (2024) Cite this article

954 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

This study examined the possibility of estimating cardiac output (CO) using a multimodal stacking model that utilizes cardiopulmonary interactions during general anesthesia and outlined a retrospective application of machine learning regression model to a pre-collected dataset. The data of 469 adult patients (obtained from VitalDB) with normal pulmonary function tests who underwent general anesthesia were analyzed. The hemodynamic data in this study included non-invasive blood pressure, plethysmographic heart rate, and SpO₂. CO was recorded using Vigileo and EV1000 (pulse contour technique devices). Respiratory data included mechanical ventilation parameters and end-tidal CO₂ levels. A generalized linear regression model was used as the metalearner for the multimodal stacking ensemble method. Random forest, generalized linear regression, gradient boosting machine, and XGBoost were used as base learners. A Bland–Altman plot revealed that the multimodal stacked ensemble model for CO prediction from 327 patients had a bias of − 0.001 L/min and − 0.271% when calculating the percentage of difference using the EV1000 device. Agreement of model CO prediction and measured Vigileo CO in 142 patients reported a bias of − 0.01 and − 0.333%. Overall, this model predicts CO compared to data obtained by the pulse contour technique CO monitors with good agreement.

Automated prediction of extubation success in extremely preterm infants: the APEX multicenter study

Article 29 July 2022

A comparative study of explainable ensemble learning and logistic regression for predicting in-hospital mortality in the emergency department

Article Open access 10 February 2024

A retrospective study of mortality for perioperative cardiac arrests toward a personalized treatment

Article Open access 12 August 2022

Introduction

Mechanical ventilation (MV) has a predictable impact on circulation^1,2. Cardiorespiratory interactions are clinically important because MV can lead to cardiac instability³. MV typically uses positive airway pressure, thereby increasing intrathoracic pressure (ITP), causing a reduction in venous return, and increasing pulmonary vascular resistance, which can decrease preload and subsequently reduce cardiac output (CO). However, increased ITP can cause a decrease in the afterload on the heart, leading to increased stroke volume and CO.

In addition, excessive tidal volume and lung hyperinflation caused by overstimulation of sensory nerve endings located within the alveolar walls can lead to reflex bradycardia and depression of the somatic nervous system⁴.

Variations in arterial pulse and systolic pressure in mechanically ventilated patients with adjusted tidal volumes can predict fluid responsiveness during acute circulatory failure related to sepsis⁵. A decrease in CO₂ concentration at the end-tidal concentration (EtCO₂) in humans and animals correlates with a reduction in pulmonary blood flow/CO^6,7. This relationship is significant and is currently implemented in anesthesia monitoring for non-invasive and minimally invasive breath-by-breath CO monitoring in patients ventilated during anesthesia and critical care⁸.

Presently, various methods for monitoring cardiovascular systems are available, including non-invasive, minimally invasive, and invasive techniques for measuring CO. Among these, thermodilution (TD) is considered the gold standard method, and pulse contour analysis is widely used^9,10. A non-calibrated pulse pressure analysis device has been demonstrated to be clinically and statistically acceptable under hypo- and normodynamic conditions¹¹.

CO is one of the most challenging hemodynamic parameters to assess in unstable patients. Even with a calibrated pulse contour hemodynamic monitoring system (VolumeView/EV1000), considerable overestimation of hemodynamic parameters has been reported when using a peripherally inserted central catheter from the brachial vein during calibration with temperature variation (ΔT) in comparison with a centrally inserted venous catheter¹².

Moreover, more than a dozen non-invasive methods have been proposed and developed to estimate CO. The simplest of these methods involves calculating CO by multiplying the stroke volume (SV) by the heart rate (HR), where SV is obtained by multiplying the pulse pressure (systolic blood pressure (SBP)–diastolic blood pressure (DBP)) by a constant value (k = 2). This method has been evaluated and observed to have a moderate correlation between the measured and estimated CO (r = 0.60, p < 0.001)¹³. Furthermore, machine learning algorithms have been employed in animal models to predict CO accurately by utilizing waveform arterial blood pressure and HR, with a difference of − 0.13 (0.69 L/min) between the sheep’s pulmonary arterial blood flow using a transit time Doppler flow probe and predicted CO¹⁴.

Cardiorespiratory interaction data comprise heterogeneous information from the patient monitor, anesthesia machine, and CO monitor. Because of this diversity, employing a single model is impractical for comprehensively learning all facets of the data. Therefore, the rationale behind the use of multimodal stacking ensembles stems from their success in integrating multiple information sources for complex decision making in various medical machine learning tasks^15,16.

Recently, there has been growing interest in applying machine learning algorithms to estimate CO, particularly from arterial pressure waveforms^14,17,18. However, classical lumped parameter models, such as the Windkessel and Liljestrand–Zender models, suggest that approximate CO can be derived from basic monitoring data^13,19. Therefore, we chose to incorporate cardiorespiratory interactions into the machine-learning prediction of CO based on numerical data. To the best of our knowledge, this is the first study to use this approach.

Results

Among the 6388 patients in the VitalDB Dataset, which measured hemodynamics including CO from the Vigileo and EV1000 devices and respiratory monitoring data, 722 were eligible for the study (Fig. 1). Patients < 18 years old were excluded (n = 8). Data were selected from the beginning to the end of surgery to ensure hands-free, automatic, and constant ventilation. Additionally, the absence of lung disease in patients undergoing general anesthesia could be achieved by selecting patients with normal pulmonary function testing from their clinical information. Consequently, exclusion criteria were applied to patients with abnormal pulmonary function tests (n = 110).

Participants with unsynchronized parameter measurements and an absence of intraoperative NIBP measurements were excluded (n = 135). After data preprocessing, 469 patients (EV1000, n = 327; Vigileo, n = 142) met the data requirements for further analysis. A summary of the patients and their clinical characteristics is presented in Supplementary Tables S1–3.

Results of the base and multimodal stacked ensemble model regression

We evaluated the individual base learners and multimodal stacking ensemble regressions on three data subsets. We first trained and validated the hemodynamic response to cardiopulmonary interactions during MV using parameters from the hemodynamic and respiratory subsets (NIBP, HR, MV, SpO₂, and EtCO₂).

The hemodynamic data subsets (NIBP and HR) were then used for training and validation of the base models to predict CO from arterial blood pressure and heart rate.

Finally, respiratory data subsets (MV, EtCO₂, and SpO₂) were used for the training and validation of the base models and to calculate the hemodynamic effects of MV.

Table 1 presents the best results of the four base models, which were derived from the regression performance metrics of GLM, RF, GBM, XGBoost, and multimodal stacked ensemble models. For both CO monitoring devices, multimodal stacking outperformed all its base models in terms of the MSE, RMSE, and MAE, measuring 0.096, 0.31, and 0.186 for EV1000 and 0.057, 0.239, and 0.139 for Vigileo, respectively. In addition, based on MAE, average errors were evaluated. Compared to base models, RMSE was more sensitive to significant errors and multimodal stacking models predicted CO more accurately.

Table 1 Performance of the base and multimodal stacked ensemble models.

Full size table

Multimodal stacked ensemble model CO prediction vs. measured CO from the arterial waveform analysis device

Figure 2 and Table 2 present the baseline agreement between the multimodal stacked ensemble model for CO prediction and the CO measured using the EV1000 device. The difference between the methods was r = 0.985, and R² was 0.97 (Fig. 2a). Bland–Altman analysis revealed that the mean difference between measurements and prediction was − 0.001 L/min (± 1.96 SD, 0.611, and − 0.614 L/min; Fig. 2b). The proportional mean difference was − 0.271% (± 1.96 SD, 12.94%, and − 13.488%; Fig. 2c).

Table 2 Bland–Altman analysis.

Full size table

The agreement between the multimodal stacked ensemble model prediction of CO and the CO measured using Vigileo was r = 0.987, and R² was 0.974 (Fig. 2d). The overall mean bias for agreement in CO was − 0.01 (± 1.96 SD, 0.464, and − 0.477 L/min; Fig. 2e). The proportional mean difference was − 0.333% (± 1.96 SD, 9.924%, and − 10.59%; Fig. 2f).

Discussion

In this study, we proposed a multimodal stacking ensemble that combines data from non-invasive cardiovascular monitoring and MV parameters, including SpO₂ and EtCO₂. A fundamental principle of the proposed model is that stacking makes the prediction accuracy better than that of a single machine learning algorithm, and stacking several algorithms significantly improves the prediction accuracy. We demonstrate that the multimodal stacked ensemble model predicts accurate and valid CO values with marginal bias and a narrow CO limit of agreement compared with those obtained using pulse contour technique devices.

Ensemble stacking regression leverages multimodal information gathered from anesthesia machine and patient monitors, deriving benefits from the RF, GBM, and XGBoost base models. It effectively captures the nonlinear relationships in the interplay between the heart and lungs during positive-pressure ventilation. Non-linear interactions of cardiopulmonary features may explain why GLM base models exhibit inferior performance compared with other base models in hemodynamic and respiratory data. An additional advantage of ensemble stacking regression is the interpretability of the final predictions obtained using the GLM metalearner. Furthermore, it demonstrates robustness by harnessing the strengths of the multiple base models.

The Bland–Altman plot is widely recognized as the standard statistical method for assessing the agreement between two consecutive measurements of the same clinical variable²¹. When using a clinical CO measurement device, Bland–Altman plots do not indicate whether the LoAs are acceptable²². For example, an agreement limitation of ± 1 L/min may not be acceptable in patients with low CO syndrome. Additionally, our results include percentage difference plots demonstrating that multimodal stacked ensemble models accurately predict CO, with predictions falling within the acceptable clinical criteria (± 30%) of the proportional mean difference when compared those obtained using the Vigileo and EV1000 devices.

In previous studies the calibrated pulse wave analysis device EV1000 has proven to be accurate and consistent, and was thus used for our reference CO measurement. The results showed good agreement and interchangeability with TD CO measurement, with a bias of − 0.07 L/min, LoA of 2.0 L/min, and a percentage of 29%²³. In addition, the uncalibrated FloTrac/Vigileo provides clinically acceptable accuracy under stable hemodynamic conditions, with an average error below 30% for CO compared with that obtained via TD¹¹. However, severe sepsis and septic shock uncalibrated FloTrac/Vigileo vs. TD revealed no clinically acceptable tracking capability with a bias of − 0.86 L/min, LoA of − 4.48 to 2.77 L/min, and a percentage error of 48%²⁴.

Our study was based entirely on non-cardiac surgery. Accordingly, NIBP was selected because it is a standard measurement for patients with ASA I and II and for intermediate-risk surgery. In addition, NIBP appears to be in acceptable agreement with invasively measured BP in patients with cardiogenic shock²⁵, MV, and arrhythmia²⁶. However, NIBP is not always well calibrated with invasive BP measurement, particularly in hypothermia and pronounced hypotension²⁵. Although invasive BP, known as beat-by-beat measurement, is considered the gold standard method of diagnosis, NIBP is associated with fewer complications, particularly catheter-associated artery pseudoaneurysms, occlusions, and infections²⁷. Occasionally, a measurement can be inaccurate owing to kinking or damping of the arterial line.

The HR was extracted from finger photoplethysmography and may represent acceptable accuracy based on electrocardiography (ECG) during normal breathing. Photoplethysmography and ECG-derived heart rates can differ moderately, and photoplethysmography shows an advantage in monitoring changes in ITP caused by ventilation, sleep apnea, and even changes in respiratory rate during deep breathing^28,29.

Using the respiratory rate based on capnography, the expiratory tidal volume, and the expiratory Vm enabled us to obtain the exact delivered volume per breathing cycle recorded in the anesthesia machine (Fig. 6a–c). Noteworthy differences between the set and delivered tidal volumes have been demonstrated in several clinical situations, such as patient lung size, lung compliance, airway resistance, and maintenance of spontaneous breathing during general anesthesia through invasively assisted spontaneous ventilation^30,31.

Visualizing cardiopulmonary interactions and variable importance in a multimodal stacked ensemble model

Providing decision support using a functional hemodynamic machine learning model based on the complex relationship between the heart and lungs during general anesthesia should be understood by the medical environment. The predictability of the model was quantified in our work using partial dependence plots (PDPs)³², model parameter importance, and interaction variables³³.

The symmetric matrix, derived from the calculation of variable importance and interactions using the RF model, was utilized to visualize the interaction variables in Table Fig. 3c, importance variables in Fig. 3b, and to construct a network graph in Fig. 3a. Variable importance is assessed exclusively based on changes in MSE. In difference, variable interactions are evaluated using the square root of the mean unnormalized version of the H-statistic, yielding a value on a scale of 0 to 1. This approach reduces the identification of spurious interactions and presents results by quantifying changes in the RMSE, which are measured on the same scale as CO in L/min. An RF model incorporating NIBP/HR/MV/SpO2/EtCO2 and CO measured by the Vigileo device was chosen to visualize the interaction and importance variables because it displayed the highest performance, with an R2 of 0.973 and an MSE of 0.074 compared to other base models.

All demographic, hemodynamic, and respiratory parameters displayed interactions to varying degrees with a range of H-statistic values (Fig. 3a and c). Hence, these plots facilitate the interpretation of cardiopulmonary interactions, particularly concerning total interactions and interactions between pairs of features, where one feature remains constant while others change, thereby influencing the accuracy of the cardiac output prediction. Demographic and hemodynamic variables, specifically weight and HR, were identified as the most important interactions, exhibiting an H-statistic value of 0.091. This finding suggests that an increase in the accuracy of the CO prediction corresponds to a reduction in the RMSE of 0.091 L/min. The constant pairs variable, HR, demonstrated the strongest reciprocal interactions with age (H-statistic = 0.058), NIBP-SBP (H-statistic = 0.045), height (H-statistic = 0.07), and EtCO2 (H-statistic = 0.056). The six variables that contributed the most to the prediction of CO in the RF model were HR, age, height, weight, NIBP-SBP, and minute volume (Fig. 3a and b).

The results of our study were consistent with well-established data demonstrating that CO levels decrease with age by approximately 1% per year after the third decade (Fig. 4a). Age-related decline in the stroke index is accompanied by decreased body size and HR, which reduces CO³⁴. We found the exact relationship between body size and CO in a straight-line regression, as observed in the last century³⁵ (Fig. 4c,d). According to our findings, in females, one-way PDPs from the RF, GBM, and XGBoost models showed a decrease in CO of approximately 10% compared to those in males during intraoperative measurements. However, this difference was smaller than the 22% difference reported during the resting state³⁶ (Fig. 4b).

HR is crucial to determining the diastolic filling time, influencing the SV via the Frank–Starling mechanism. For cardiopulmonary interactions during MV, venous return can be reduced, which can further compromise diastolic filling, particularly at high heart rates. Our study revealed a linear relationship between CO and HR up to 90/min, where deceleration began (Fig. 4e). Early curve deceleration is well documented in impaired right heart filling³⁷. However, here, this may have been influenced by factors, such as autonomic nervous system activity, blood volume, and heart contractility, which were beyond the scope of this study.

The relationships between SBP, DBP, and CO during general anesthesia are complex and dynamic. In our study, we observed an increase in SBP corresponding to an increase in CO of up to 120 mmHg following the onset of the deceleration curve (Fig. 4f). The decreased CO level during high intraoperative SBP may be caused by increased vascular resistance, stiffened large arteries³⁸, and reduced SV owing to elevated afterload. Our study demonstrated a decrease in DBP with a marginal increase in CO (Fig. 5a). An increase in pulse pressure might elucidate the observed increase in CO. An increase in SV owing to volume substitution results in increased CO, causing an increase in pulse pressure. Cardiopulmonary interactions and additional interventions such as vasopressor administration or adjustments to ventilator settings may play a substantial role. Additionally, the nonlinear relationship between pulse pressure, cardiac index (CI), and deceleration curve starting at a CI of 3 L/min/m² has been well documented³⁹.

One-way PDPs revealed an inverse relationship between CO and airway pressure (Fig. 5e). A decrease in SV and venous return is the primary mechanism by which increasing airway pressure reduces CO. The application of airway pressure levels at 10, 20, and 30 cm H₂O led to a variation in the CI between + 6% and − 43%, which was associated with corresponding changes in the SV index (p < 0001, r2 = 0.89)⁴⁰. Our findings align with those of earlier studies, as they indicated an increase in airway pressure during lung inflation and a reduction in CO at a rate of 0.5 L/min per 10 mbar increase in PIP.

PEEP increases ITP during the entire respiratory cycle to restore normal end-expiratory lung volume during MV. Increasing the PEEP levels allowed for greater lung expansion. PEEP during MV may also displace blood from the pulmonary circulation, increase mean systemic pressure, reduce venous return, and decrease CO and tissue perfusion⁴¹. Our model exhibits a decrease in the CO rate of 0.1 L/min by raising PEEP to 2.5 mbar (Fig. 5f). This decrease in CO with increasing PEEP in a curvilinear relationship has been previously reported⁴².

A reduction in TV increases CO; nevertheless, the degree of improvement in hemodynamics depends largely on ITP⁴³. Reducing the tidal volume increases chest wall compliance by decreasing ITP during MV and increasing venous return, leading to increased left ventricular preload and CO. This is consistent with our finding; our model showed an increase in CO of 0.03 L/min per 1 mL/kg of TV reduction (Fig. 6b). A tidal volume > 15 mL/kg markedly decreases HR and blood pressure and reduces CO⁴⁴. However, we could not evaluate this observation with limited training data for tidal volumes > 15 mL/kg, and a machine learning model could not make meaningful predictions.

Changes in exhaled carbon dioxide during general anesthesia with stable ventilation correspond to changes in CO and metabolic CO₂ production⁴⁵. At ETCO₂ levels > 30 mmHg, RF, GBM, and XGBoost models predict a satisfactory CO increase of 0.5 L/min per 10 mmHg of ETCO₂ (Fig. 5d). A similar correlation between ETCO₂ and CO has been reported in previous studies⁴⁶. An animal model during cardiopulmonary resuscitation showed a correlation coefficient of 0.79 between EtCO₂ and CI⁴⁷. This finding is consistent with that of the GLM model. However, the GLM model had a lower performance than that of the RF, GBM, and XGBoost models and had less training data with EtCO₂ < 30 mmHg.

A decline in SpO₂ was observed with decreasing CO in all base models in our study (Fig. 5c). Decreased CO caused by cardiopulmonary interactions is the primary factor in the reduction of arterial oxygen content observed during MV⁴⁸. Hypovolemia and vasodilation, which are commonly observed during general anesthesia, may also contribute to this phenomenon. However, our data did not allow us to determine whether the increased inspired O₂ fraction reflected an increase in CO (Fig. 5b). It is widely recognized that increases in FiO₂ at fixed values of CO fail to detect conditions of low oxygen supply during central venous oxygen saturation⁴⁹.

This study may be more compelling if the model was applied to a dataset that included direct CO measurements obtained through thermodilution using a pulmonary artery catheter. Nevertheless, the interpretability of the developed multimodal stacking ensemble is a notable strength of the proposed system. By offering valuable insights into the interpretation of the model, we deepen our understanding of all purely physiological inputs implicated in CO estimation. This not only enhances scholarly comprehension within the discipline, but also promotes the endorsement and integration of the system among healthcare practitioners. The architecture of this model aligns with the characteristics of "locked" algorithms as defined in the proposed regulatory framework for modifications to Artificial intelligence/machine learning (AI/ML)-based Software as a Medical Device by the food and drug administration (FDA)⁵⁰. Training the complex algorithm with numerical data enhanced its versatility, allowing the model to be saved, exported, and deployed in diverse medical environments for production use.

Further limitations of this study are as follows. The data analyzed was from one source only and focused solely on adult patients. During data mining, we could not find synchronized records of sudden blood loss or vasoactive infusions. This limitation has an impact on our model's ability to assess fluid responsiveness and requires thorough evaluation when our model undergoes testing in real-time general anesthesia scenarios. The perioperative clinical information dataset contained data on estimating intraoperative blood loss and cumulative intraoperative use of vasoactive medications (ephedrine, phenylephrine, and epinephrine). However, this information lacks a recorded time, making it unsuitable for inclusion in our model. Mechanical ventilation without spontaneous effort may affect hemodynamics differently; nonetheless, the ventilation modes were not documented in the VitalDB data, leading to their exclusion from this study. In addition, the small number of patients with obstructive or restrictive lung diseases made it difficult to include them in the data subset. Although constant ventilation was ensured during surgery in this study, it is important to recognize that the period from the onset of anesthesia to the start of surgery and the time between the end of surgery and extubation are important for comprehending the influence of cardiopulmonary interactions on hemodynamics. During extubation or weaning, spontaneous inspiratory efforts in patients with obstructive and restrictive lung disease may strongly decrease CO by increasing the left ventricular afterload, especially if left ventricular function is already impaired⁴³. Our model should be improved in the future to address these cardiopulmonary interactions.

Conclusion

Using a multimodal stacking ensemble algorithm, involving two-component regression based on hemodynamic and respiratory monitoring inputs, acceptable performance was achieved in comparison to data obtained by the pulse contour technique CO monitors.

This innovative methodology has the potential to discern the intrinsic physiological processes occurring in cardiopulmonary interaction during mechanical ventilation at a 14-s interval, particularly in the context of CO estimation. Based on the last recorded monitoring parameter, the model predicts only current CO for each interval of 14 s. By predicting CO cumulatively over time, we can assess the impact of cardiopulmonary interaction on CO during mechanical ventilation.

Current research has the potential to address the rising demand for non-invasive CO measurements; however, it is crucial to conduct external validation using several data sources and diverse patient conditions.

Methods

Data source

In this study, de-identified data were used from an open database of non-cardiac surgery patients who underwent routine or emergency operations at Seoul National University Hospital, Seoul, Korea, from August 2016 to June 2017⁵¹. This database contains prospectively collected intraoperative vital sign data from 6,388 general, thoracic, urological, and gynecological surgery cases, with the formal approval of an ethics committee/IRB (H-1408-101-605) and registered at www.clinicaltrials.gov (identifier: NCT02914444). Perioperative clinical information was retrospectively obtained. In addition, several anesthesia devices recorded up to 12 waveforms and 184 numeric data tracks during surgery using the Vital Recorder program.

Monitoring parameters and data structure

Hemodynamic data, obtained as numeric values at 2-s intervals, included non-invasive blood pressure (NIBP), plethysmographic HR, and SpO₂ data (Solar™ 8000 M, GE healthcare, Wauwatosa, WI, USA). In addition, CO was recorded using pulse contour technique monitors such as Vigileo and EV1000 (Edwards Lifesciences, Irvine, CA, USA).

Respiratory data collected using the anesthesia machine (Primus, Dräger, Lübeck, Germany) were recorded at 7-s intervals. MV was determined by estimating the fraction of inspired oxygen (FiO₂), expiratory TV, expiratory minute volume (Vm), positive end-expiratory pressure (PEEP), peak inspiratory pressure (PIP), respiratory rate based on capnography, and infrared spectrometry capnography, which measures EtCO₂, thereby ensuring adequate and accurate ventilation per period.

According to the official VitalDB data descriptor, invalid data tracks that were identified during the data check were excluded. Following this exclusion, the data were organized into nonconstant time intervals⁵¹.

Data extraction occurred at a rate of one second per interval, involving 16 parameters and comprising a total of 6,236,640 rows over a duration of 1732.4 h (EV1000:1288.03 h; Vigileo: 444.1 h) intraoperative monitoring with 99,786,240 datapoints. Hemodynamic monitoring data, collected at 2-s intervals, were aligned with the anesthesia machine data and recorded at 7-s intervals. Consequently, data synchronization occurred every 14 s, encompassing 538,354 rows of data. CO, marked as a target intraoperative parameter, demonstrated a 43.5% reduction in missed data, leaving 234,225 rows.

The NIBP measurements in VitalDB were recorded every 2 s. However, intraoperative NIBP in this data subset was intermittently measured over a period of 2–45 min, with interval data ranging from 1 to 10 min recorded for each measurement. The absence of NIBP during surgery led to a 77.9% reduction in data. After extracting data for adult patients aged > 18 (constituting 2.4% of the dataset) and removing the remaining 1% of missing values, a total of 49,007 pairs of row measurement data were available for subsequent analyses. To uphold the rigor of the analysis and enhance the precision of the results, rows containing missing values were excluded from the dataset.

Retrieving demographic and patient characteristics

The VitalDB records were identified using case IDs that could be matched with the case IDs in the perioperative clinical information. After data preprocessing, each record was matched to its corresponding perioperative clinical information to retrieve the demographic and patient characteristics. Subject characteristics, including age, sex, weight, height, body mass index (BMI), ASA grade, preoperative comorbidity, department, operation type, surgical approach, and postoperative ICU stay, were analyzed. In addition to describing the parameters, descriptive statistics were used to describe them in terms of minimum, maximum, mean, standard deviation, median, 25th–75th quartiles, and 95% confidence intervals (see Supplementary Tables S1–3).

Evaluating levels of general anesthesia

Both inhalational and intravenous anesthetics influence systemic vascular resistance and cardiac contractility, leading to a reduction in CO⁵². Therefore, it is imperative to delineate the depth of anesthesia while maintaining general anesthesia in this project. In instances of total intravenous anesthesia (TIVA), propofol was titrated to maintain the bispectral index (BIS) at 41.07 ± 9 and delivered through a target-controlled infusion pump. For inhalational anesthesia, sevoflurane and desflurane were adjusted using a vaporizer to target the Minimum Alveolar Concentration (MAC) at 0.87 ± 0.24 independently of the BIS value. The depth of anesthesia administered during general anesthesia for this study is delineated in Supplementary Table 5.

Model development

For multisystem CO prediction, we used multimodal stacking-based ensemble learning regression techniques (Fig. 1). We split the multimodal data into training, validation, and test sets at a ratio of 7.5:1.25:1.25.

After preprocessing, the training and validation data were used to train and validate the model using both hemodynamic and respiratory parameters (NIBP, HR, MV, SpO₂, and EtCO₂), to calculate the relationship between cardiorespiratory interactions and CO. Further training and validation data were separated into hemodynamic (NIBP and HR subsets) and respiratory parameters (MV, SpO₂, and EtCO₂ subsets). Three data subsets were constructed using the demographic variables (age, height, weight, and sex).

Four base learner models were used for the multimodal stacking ensemble in this study (Level-0): a generalized linear model (GLM)⁵³, Random Forest (RF)⁵⁴, Gradient Boosting Machine (GBM)⁵⁵, and extreme gradient boosting (XGBoost)⁵⁶. Excluding GLM, all other models were nonlinear regressions.

Within the R interface for ‘H₂O’, a scalable open-source platform, we employed a random grid search methodology to pinpoint an optimal set of hyperparameter values for maximizing the effectiveness of our models on the dataset. The H2O platform employs a random hyperparameter search with time and metric based early stopping, enabling the identification of high-quality models within a limited computational timeframe^32,57.

To optimize the hyperparameters of the GLM, distributed RF, GBM, and XGBoost regression models in the three data subsets, a random search was conducted by splitting the training set fivefold to optimize the hyperparameters and enhance the model performance by lowering the predicted value error measured through the mean absolute error (MAE). The key hyperparameters for DRF, GB, and XGBoost (ntrees, max_depth, learn_rate, sample_rate/col_sample_rate, and min_rows) were employed, whereas for GLM, alpha and lambda were utilized. Additionally, search criteria including max_models, max_runtime_secs, stopping_tolerance, and stopping_rounds were applied. The hyperparameters of the base models are summarized in Supplementary Table 4.

The best optimized regression predictions of the 12 base models from the three data subsets were subsequently used as input features for the multimodal stacking ensemble method (Level-1)¹⁵.

In the second step, the Stacked Ensemble method uses a metalearning algorithm to learn the optimal combination of the base learners⁵⁸. The metalearner is a default H2O GLM with non-negative weights. The GLM metalearner was evaluated during the implementation of a stacking regression with cross-validation, where lambda was employed as a hyperparameter.

The comparison results of the base models and multimodal stacked assembly with the performance metrics in the regression are listed in Table 1.

Performance metrics in regression

The mean square error (MSE), root mean square error (RMSE), MAE, and coefficient of determination (R²) were used as performance indicators to evaluate the regression algorithms. The MSE and RMSE are commonly used regression metrics.

$$RMSE=\sqrt{\frac{1}{N}} \sum_{i = 1}^{N}{({y}_{i}-{y}_{i}^{p})}^{2}.$$

(1)

By taking the square root of MSE, the RMSE (Eq. 1) measures the difference between the predicted CO ${y}_{i}^{p}$ and measured CO values ${y}_{i}$. The RMSE was calculated as the square root of the sum of all regression errors per row divided by the total number of observations. The regression performance improved, with lower RMSE and MSE values.

$$MAE=\frac{1}{N}\sum_{i = 1}^{N}\left[{y}_{i}-{y}_{i}^{p}\right].$$

(2)

The MAE measures the difference between the measured CO ${y}_{i}$ and predicted CO ${y}_{i}^{p}$ values divided by the total number of observations (Eq. 2). Low MAEs indicate high model accuracy.

$${R}^{2}=1-\frac{\sum_{i=1}^{N}{\left({y}_{i}-{y}_{i}^{p}\right)}^{2}}{\sum_{i=1}^{N}{\left({y}_{i}-\overline{y}\right)}^{2}}.$$

(3)

R² represents how well a regression model explains the variability between the measured and predicted CO. From 0 to 1, the higher the value, the better the model (Eq. 3). R² represents the variability explained by the model (squared difference between the target ${y}_{i}$ and the predicted value ${y}_{i}^{p}$) divided by the total deviance y value (squared difference between the target ${y}_{i}$ and the mean target values $\overline{y}$).

Model evaluation

We used the Bland–Altman method as a statistical standard to compare the measures of CO ${y}_{i}$ from both EV 1000 and Vigileo with the multimodal stacking ensemble model prediction CO ${y}_{i}^{p}$⁵⁹. The Bland–Altman plot was introduced to describe the agreement, where the y-axis shows the difference between the measured and model-predicted CO (${y}_{i}-{y}_{i}^{p}$), and the x-axis represents the average of the measured and model-predicted CO ((${y}_{i}$+${y}_{i}^{p}$)/2). In summary, the absolute mean difference between the measured and predicted CO ($\overline{d}=\frac{1}{N}\sum_{i=1}^{N}{(y}_{i}-{y}_{i}^{p}$) can be used to estimate the constant bias, and the limits of agreement (LoAs) lie between $\overline{d}-$ 1.96S_d and $\overline{d}$+1.96S_d, where S_d is the standard deviation. Percentage difference plots and analyses were used to assess the proportional differences between the measured and model-predicted CO. This shows how this error influences lower CO measurements, whereas for higher CO values, the percentage bias is decreased⁶⁰. In the proportional Bland–Altman plots, bias changes over the measuring range, and are presented as a proportional mean difference $\overline{\overline{d}}$= $\frac{1}{N}\sum_{i = 1}^{N}\frac{({y}_{i} - {y}_{i}^{p})}{({y}_{i}+{y}_{i}^{p})/2}$, and the LoA lay between $\overline{\overline{d}}-$ 1.96S_d and $\overline{\overline{d}}$+1.96S_d.

Data availability

The datasets generated and/or analysed during the current study are available in the VitalDB repository, https://physionet.org/content/vitaldb/1.0.0/. The R-based interactive web applications for this study can be accessed at the address provided below: https://albiondervishi.shinyapps.io/CO_Shiny/.

References

Karamolegkos, N., Albanese, A. & Chbat, N. W. Heart-lung interactions during mechanical ventilation: Analysis via a cardiopulmonary simulation model. IEEE Open J. Eng. Med. Biol. 2, 324–341 (2021).
Article PubMed Google Scholar
Ngo, C. A simulative model approach of cardiopulmonary interaction. IFMBE Proc. 51, 1679–1682 (2015).
Article Google Scholar
Vieillard-Baron, A. Acute cor pulmonale in acute respiratory distress syndrome submitted to protective ventilation: Incidence, clinical implications, and prognosis. Crit. Care Med. 29, 1551–1555 (2001).
Article CAS PubMed Google Scholar
Shepherd, J. T. The lungs as receptor sites for cardiovascular regulation. Circulation 63, 1–10 (1981).
Article CAS PubMed Google Scholar
Michard, F. Relation between respiratory changes in arterial pulse pressure and fluid responsiveness in septic patients with acute circulatory failure. Am. J. Respir. Crit. Care Med. 162, 134–138 (2000).
Article CAS PubMed Google Scholar
Siobal, M. S. Monitoring exhaled carbon dioxide. Respir. Care 61, 1397–1416 (2016).
Article PubMed Google Scholar
Harrison, M. J., Scott-Weekly, R. & Zacharias, M. The qualitative detection of decreases in cardiac output. Comput. Biol. Med. 58, 85–90 (2015).
Article PubMed Google Scholar
Peyton, P. J. Continuous minimally invasive peri-operative monitoring of cardiac output by pulmonary capnotracking: Comparison with thermodilution and transesophageal echocardiography. J. Clin. Monit. Comput. 26, 121–132 (2012).
Article PubMed Google Scholar
Monnet, X. Comparison of pulse contour analysis by Pulsioflex and Vigileo to measure and track changes of cardiac output in critically ill patients. Br. J. Anaesth. 114, 235–243 (2015).
Article CAS PubMed Google Scholar
Kiefer, N. Clinical validation of a new thermodilution system for the assessment of cardiac output and volumetric parameters. Crit. Care 16, 1–11 (2012).
Article Google Scholar
Slagt, C., Malagon, I. & Groeneveld, A. B. J. Systematic review of uncalibrated arterial pressure waveform analysis to determine cardiac output and stroke volume variation. Br. J. Anaesth. 112, 626–637 (2014).
Article CAS PubMed Google Scholar
D’Arrigo, S. Are peripherally inserted central catheters suitable for cardiac output assessment with transpulmonary thermodilution?. Crit. Care Med. 47, 1356–1361 (2019).
Article PubMed Google Scholar
Hill, L. B. K., Sollers, J. J. & Thayer, J. F. Evaluation of a simple estimation method for the derivation of cardiac output from arterial blood pressure and heart rate. Biomed. Sci. Instrum. 48, 165–170 (2012).
PubMed Google Scholar
Liu, N. T., Kramer, G. C., Khan, M. N., Kinsky, M. P. & Salinas, J. Blood pressure and heart rate from the arterial blood pressure waveform can reliably estimate cardiac output in a conscious sheep model of multiple hemorrhages and resuscitation using computer machine learning approaches. J. Trauma Acute Care Surg. 79, S85–S92 (2015).
Article PubMed Google Scholar
Yoon, T. & Kang, D. Multi-modal stacking ensemble for the diagnosis of cardiovascular diseases. J. Pers. Med. 13, 373 (2023).
Article PubMed PubMed Central Google Scholar
Ding, W., Wu, S. & Nugent, C. A multimodal fusion enabled ensemble approach for human activity recognition in smart homes. Health Inform. J. https://doi.org/10.1177/14604582231171927 (2023).
Article Google Scholar
Ke, L. Machine learning algorithm to predict cardiac output using arterial pressure waveform analysis. In Proc. 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022 (2022). https://doi.org/10.1109/BIBM55620.2022.9995429.
Yang, H. L. Development and validation of an arterial pressure-based cardiac output algorithm using a convolutional neural network: Retrospective study based on prospective registry data. JMIR Med. Inform. 9, e24762 (2021).
Article PubMed PubMed Central Google Scholar
Liljestrand, G. & Zander, E. Vergleichende Bestimmungen des Minutenvolumens des Herzens beim Menschen mittels der Stickoxydulmethode und durch Blutdruckmessung. Z. Gesamte Exp. Med. 59, 105–122 (1928).
Article Google Scholar
Ripley, B. Package ‘MASS’ (Version 7.3–51.4). Cran-R Proj. (2019).
Karun, K. M. & Puranik, A. BA.plot: An R function for Bland-Altman analysis. Clin. Epidemiol. Glob. Heal. 12, 100831 (2021).
Article CAS Google Scholar
Odor, P. M., Bampoe, S. & Cecconi, M. Cardiac output monitoring: Validation studies–how results should be presented. Curr. Anesthesiol. Rep. 7, 410–415 (2017).
Article PubMed PubMed Central Google Scholar
Bendjelid, K. Performance of a new pulse contour method for continuous cardiac output monitoring: Validation in critically ill patients. Br. J. Anaesth. 111, 573–579 (2013).
Article CAS PubMed Google Scholar
Slagt, C., Helmi, M., Malagon, I. & Groeneveld, A. B. J. Calibrated versus uncalibrated arterial pressure waveform analysis in monitoring cardiac output with transpulmonary thermodilution in patients with severe sepsis and septic shock: An observational study. Eur. J. Anaesthesiol. 32, 5–12 (2015).
Article PubMed Google Scholar
Seidlerová, J., Tůmová, P., Rokyta, R. & Hromadka, M. Factors influencing the accuracy of non-invasive blood pressure measurements in patients admitted for cardiogenic shock. BMC Cardiovasc. Disord. 19, 1–10 (2019).
Article Google Scholar
Lakhal, K. Blood pressure monitoring during arrhythmia: Agreement between automated brachial cuff and intra-arterial measurements. Br. J. Anaesth. 115, 540–549 (2015).
Article CAS PubMed Google Scholar
Tosti, R., Özkan, S., Schainfeld, R. M. & Eberlin, K. R. Radial artery pseudoaneurysm. J. Hand Surg. Am. 42, S44–S45 (2017).
Article Google Scholar
Khandoker, A. H., Karmakar, C. K. & Palaniswami, M. Comparison of pulse rate variability with heart rate variability during obstructive sleep apnea. Med. Eng. Phys. 33, 204–209 (2011).
Article PubMed Google Scholar
Schäfer, A. & Vagedes, J. How accurate is pulse rate variability as an estimate of heart rate variability?: A review on studies comparing photoplethysmographic technology with an electrocardiogram. Int. J. Cardiol. 166, 15–29 (2013).
Article PubMed Google Scholar
Yamaguchi, Y. The difference between set and delivered tidal volume: A lung simulation study. Med. Devices Evid. Res. 13, 205–211 (2020).
Article Google Scholar
Ruszkai, Z. & Szabó, Z. Maintaining spontaneous ventilation during surgery—A review article. J. Emerg. Crit. Care Med. 4, 5 (2020).
Article Google Scholar
LeDell, E. o.fl. Package „h2o“. April (2020).
Inglis, A., Parnell, A. & Hurley, C. B. Visualizing variable importance and variable interaction effects in machine learning models. J. Comput. Graph. Stat. 31, 766–778 (2022).
Article MathSciNet Google Scholar
Brandfonbrener, M., Landowne, M. & Shock, N. W. Changes in cardiac output with age. Circulation 12, 557–566 (1955).
Article CAS PubMed Google Scholar
Jegier, W., Sekelj, P., Auld, P. A., Simpson, R. & McGregor, M. The relation between cardiac output and body size. Br. Heart J. 25, 425–430 (1963).
Article CAS PubMed PubMed Central Google Scholar
Forton, K., Motoji, Y., Caravita, S., Faoro, V. & Naeije, R. Exercise stress echocardiography of the pulmonary circulation and right ventricular-arterial coupling in healthy adolescents. Eur. Heart J. Cardiovasc. Imaging 22, 688–694 (2021).
Article PubMed Google Scholar
Sugimoto, T., Sagawa, K. & Guyton, A. C. Effect of tachycardia on cardiac output during normal and increased venous return. Am. J. Physiol. 211, 288–292 (1966).
Article CAS PubMed Google Scholar
Dart, A. M. & Kingwell, B. A. Pulse pressure—A review of mechanisms and clinical relevance. J. Am. Coll. Cardiol. 37, 975–984 (2001).
Article CAS PubMed Google Scholar
Petrie, C. J. Low pulse pressure as a poor-man’s indicator of a low cardiac index in patients with severe cardiac dysfunction. J. Cardiovasc. Med. 15, 315–321 (2014).
Article Google Scholar
Jellinek, H., Krafft, P., Fitzgerald, R. D., Schwarz, S. & Pinsky, M. R. Right atrial pressure predicts hemodynamic response to apneic positive airway pressure. Crit. Care Med. 28, 672–678 (2000).
Article CAS PubMed Google Scholar
Kuhn, B. T., Bradley, L. A., Dempsey, T. M., Puro, A. C. & Adams, J. Y. Management of mechanical ventilation in decompensated heart failure. J. Cardiovasc. Dev. Dis. 3, 33 (2016).
PubMed PubMed Central Google Scholar
Dhainaut, J. F. Mechanisms of decreased left ventricular preload during continuous positive pressure ventilation in ARDS. Chest 90, 74–80 (1986).
Article CAS PubMed Google Scholar
Pinsky, M. R. Cardiopulmonary interactions: Physiologic basis and clinical applications. Ann. Am. Thorac. Soc. 15, S45–S48 (2018).
Article PubMed PubMed Central Google Scholar
Cheifetz, I. M. Increasing tidal volumes and pulmonary overdistention adversely affect pulmonary vascular mechanics and cardiac output in a pediatric swine model. Crit. Care Med. 26, 710–716 (1998).
Article CAS PubMed Google Scholar
Monge García, M. I. Non-invasive assessment of fluid responsiveness by changes in partial end-tidal CO₂ pressure during a passive leg-raising maneuver. Ann. Intensive Care 2, 2–9 (2012).
Article Google Scholar
Baraka, A. S. End-tidal CO₂ for prediction of cardiac output following weaning from cardiopulmonary bypass. J. Extra Corpor. Technol. 36, 255–257 (2004).
Article PubMed Google Scholar
Weil, M. H., Bisera, J., Trevino, R. P. & Rackow, E. C. Cardiac output and end-tidal carbon dioxide. Crit. Care Med. 13, 907–909 (1985).
Article CAS PubMed Google Scholar
Kelman, G. R., Nunn, J. F., Prys-roberts, C. & Greenbaum, R. The influence of cardiac output on arterial oxygenation: A theoretical study. Br. J. Anaesth. 39, 450–458 (1967).
Article CAS PubMed Google Scholar
Zampieri, F. G., Park, M., Azevedo, L. C. P., Amato, M. B. P. & Costa, E. L. V. Effects of arterial oxygen tension and cardiac output on venous saturation: A mathematical modeling approach. Clinics 67, 897–900 (2012).
Article PubMed PubMed Central Google Scholar
Administration, U. S. F. and D. Proposed regulatory framework for modifications to artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD). U.S. Food Drug Adm. (2019).
Lee, H.-C. VitalDB, a high-fidelity multi-parameter vital signs database in surgical patients. Sci. Data 9, 279 (2022).
Article PubMed PubMed Central Google Scholar
Johnson, J. S. & Loushin, M. K. The effects of anesthetic agents on cardiac function. In Handbook of Cardiac Anatomy, Physiology and Devices 3rd edn (ed. Iaizzo, P. A.) (Springer International Publishing, 2015). https://doi.org/10.1007/978-3-319-19464-6_17.
Chapter Google Scholar
Breslow, N. E. Generalized linear models: Checking assumptions and strengthening conclusions. Transformation 8, 23–41 (1996).
Google Scholar
Geurts, P., Ernst, D. & Wehenkel, L. Extremely randomized trees. Mach. Learn. 63, 3–42 (2006).
Article Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. https://doi.org/10.1214/aos/1013203451 (2001).
Article MathSciNet Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proc. of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016).
Bergstra, J. & Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012).
MathSciNet Google Scholar
Breiman, L. Stacked regressions. Mach. Learn. 24, 49–64 (1996).
Article Google Scholar
Bland, J. M. & Altman, D. G. Measuring agreement in method comparison studies. Stat. Methods Med. Res. 8, 135–160 (1999).
Article CAS PubMed Google Scholar
Giavarina, D. Understanding Bland Altman analysis. Biochem. Med. 25, 141–151 (2015).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Anaesthesiology and Intensive Care Medicine, Medius CLINIC NÜRTINGEN-Academic Teaching Hospital of the University of Tübingen, Auf dem Säer 1, 72622, Nürtingen, Germany
Albion Dervishi

Authors

Albion Dervishi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Albion Dervishi conceptualization, design, methodology, data analytics, visualization, writing—original draft.

Corresponding author

Correspondence to Albion Dervishi.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables S1–S3.

Supplementary Table S4.

Supplementary Table S5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dervishi, A. A multimodal stacked ensemble model for cardiac output prediction utilizing cardiorespiratory interactions during general anesthesia. Sci Rep 14, 7478 (2024). https://doi.org/10.1038/s41598-024-57971-6

Download citation

Received: 27 May 2023
Accepted: 23 March 2024
Published: 29 March 2024
DOI: https://doi.org/10.1038/s41598-024-57971-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.