Introduction

Evaluation of circulation is determinant to deliver indicated care in patients with suspected compromised circulation. The paradox is that when it is critical it is also difficult to measure. In out-of-hospital cardiac arrest (OHCA), detecting a pulse-generating rhythm (PR) that produces the return of spontaneous circulation (ROSC)1 is key for a prompt cessation of unnecessary chest compressions that could reinduce ventricular fibrillation2 and a quick initiation of post-resuscitation care3. Nevertheless, palpating pulse in the carotid is not a reliable measurement of circulation4,5 and the evaluation of 'normal breathing' proposed by current resuscitation guidelines6 has resulted in an unreliable indicator of circulation7,8. Moreover, when visually inspecting the electrocardiogram (ECG) PR can be misinterpreted as electromechanical dissociations of heart functions, namely pulseless electrical activity (PEA) and pseudo-PEA (PPEA). Both PEA and PPEA present a (quasi)-normal ECG with visible QRS complexes. During PEA there is a total loss of the contractile function of the heart, while during PPEA the heart contracts feebly, but insufficiently for a proper organ perfusion and for a recovery of consciousness9.

Cardiac output (CO) monitoring might be the key to differentiate PEA (nonexistent CO), PPEA (low CO) and PR (normal CO). Unfortunately, there is no ideal system for prehospital CO monitoring which should be non-invasive, easy to use, advanced skills not needed, accurate, continuous, reliable and compatible in adult/pediatric patients10. To date, the thermodilution technique using pulmonary artery catheter (PAC) has been considered the gold standard in CO measurement. Nevertheless, it is invasive, not feasible in the prehospital setting, and the appropriateness of using PACs in critically ill patients has been questioned since catheters are associated with increased morbidity and mortality11,12. For these reasons, in the last two decades a gradual transition from invasive to minimally or non-invasive techniques has been witnessed13. Among the minimally invasive techniques, pulse contour analysis of the arterial pressure waveform10,14, transpulmonary thermodilution15, transesophageal echocardiography16,17 and partial CO2 rebreathing18 are the ones implemented by manufacturers into their commercial CO monitors. However, most of these methods are not feasible for use in a prehospital emergency setting, because the equipment is either technically difficult to operate or unreliable in the prehospital scenario, time-consuming to deploy, or special skills are necessary to get reliable readings.

A suitable non-invasive and low-cost alternative is the ballistocardiography (BCG) that measures the recoil forces of the body caused by the blood pumped by the beating heart into the vascular bed19. The scientific literature on the use of BCG for CO estimation is scarce. The idea of estimating CO from the BCG was conceived by Starr et al. in 194020. They proposed the first formula to compute stroke volume in subjects at rest using features (mainly durations and areas) from the BCG measured through a lightweight bed suspended by long cables. However, the extremely cumbersome hardware and the advent of echocardiography led the medical community to widely abandon the BCG21. Recently, research on BCG has reemerged thanks to the technological advancements that reduced in size the sensors. Thus, waveform features extracted from BCG sensors located on a bathroom scale22,23, on the seat of a chair24, and on body surface25 have been reported to track changes in CO. The amplitude of the BCG waveform recorded by sensors located on a force plate26 and on the wrist27 has been found to be positively correlated with CO. BCG acquired by sensors installed on home bathroom scale28, on a static-charge-sensitive bed29, and on the wrist27 has even been suggested as a surrogate measure of heart contractility. Nevertheless, none of the previous studies were conducted in a prehospital emergency setting and did not propose a method for CO estimation. Therefore, the aim of this study was to analyze the feasibility of estimating CO using morphological features extracted from BCG piezoelectric biosensors placed on the body surface of healthy subjects during test conditions that may mimic an emergency medicine situation.

Materials

Study setting, design and protocol

We conducted an observational quality controlled scenario based study of healthy volunteer subjects at Oslo University Hospital (Oslo, Norway). The Regional Ethics Committee for Medical and Health Research (REK 153368) and the Data Safety Commission approved the study which was registered in ClinicalTrials.gov (NCT04585568). All methods were carried out in accordance with relevant guidelines and regulations. All subjects received written and oral information about the procedures involved and signed consent before enrolment. The invasive procedure (fully described in the supplementary materials) and potential risks of radial artery cannulation was described explicitly and in detail, also as required and approved upon by the Ethical Committee (REK, see above). The study was coordinated by the Norwegian National Advisory Unit on Prehospital Emergency Medicine (NAKOS) and the Division of Prehospital Services, Oslo University Hospital.

The study was conducted in the cardiology department of the Oslo University Hospital (Ullevål Hospital) where each subject lied on an ambulance stretcher with (1) the defibrillation pads of LIFEPAK® 15 monitor/defibrillator (Stryker, Redmond, WA, USA) attached in anterolateral position for electrocardiogram (ECG, lead II) recording, (2) with the FloTrac sensor attached to the radial arterial line (see details of the procedure in the supplementary materials) and connected to the HemoSphere monitor (Edwards Lifesciences Corporation, Nyon, Switzerland) for recording the invasive arterial blood pressure that allows the CO calculation based on pulse wave contour analysis, and (3) two BCG piezoelectric biosensors (Kopera Norway AS, Stavanger, Norway) placed on the skin over the carotid artery and the abdominal aorta and connected via USB to a laptop (Lenovo ThinkPad Ultrabook) equipped with ad hoc software for recording and storing BCG signals (see Fig. 1 for an overview of biosensor placements).

Figure 1
figure 1

Overview of biosensor placements and study flow. (a) Subject placed on a stretcher. (b) BCG biosensor placed on the skin over the carotid artery. (c) FloTrac transducer for measuring arterial blood pressure. (d) Radial arterial line connected to the FloTrac sensor and transducer. (e) Defibrillation pads connected to the LIFEPAK® 15 monitor showing ECG (example of red ECG waves shown over blue BCG waves). (f) BCG biosensor placed on the skin over the abdominal aorta. (g) Flow of the study from left to right showing each phase with corresponding BCG waves. CCA, chest circumference at armpit level; CCX, chest circumference at the xiphoid process level; DJX, distance from the jugular notch to the xiphoid process.

The protocol consisted of six consecutive continuous phases without interruptions: Phase 1 (Normoventilation for 9 min), Phase 2 (Hyperventilation for 1 min), Phase 3 (Hypoventilation by holding the breath for 0.5 min), Phase 4 (Trendelenburg, subjects were put into a 30° Trendelenburg position for around 2 min), Phase 5 (Ambulance Tour, consisted of transportation of the subject on the stretcher three floors down in an elevator to an ambulance for a 5 min standardized ride on very bumpy roads and back to the cardiology department), and Phase 6 (Infusion, subjects immediately received a 500 ml infusion of Ringer Acetat and normoventilated for around 4 min). The graphical representation of the study protocol is shown in Fig. 1.

Monitoring and verification of the different phases were carried out using a GoPro camera attached to the stretcher. All phases’ start and endpoints were marked with annotations in addition to verbal prompts of start and stop audiotaped by the GoPro camera. After the last phase of each subject the radial arterial line was removed and pressure was applied to the insertion site for 10 min, for safety the subjects were observed by an anesthesiologist during this time. Subjects were followed up concerning pain or discomfort at the insertion site, one subject had temporary discomfort which resolved without further measures.

Data materials

The study database contained recordings from a total of 20 healthy adult subjects. Each subject recording consisted of concurrent and continuous measurements (hereinafter signals) of the ECG, carotid and abdominal BCG (from here on BCGc and BCGa) recorded with a sampling frequency of 250 Hz, and the CO signal at a sampling frequency of 1/2 Hz (a sample every 2 s). The CO signal was considered the ground truth and computed by the HemoSphere device through the pulse wave contour analysis of the invasive arterial blood pressure. Since each signal was recorded by a different device, all the signals were first converted to a standard open format in MATLAB (Matick, MA, USA) using our own codes, and then, an ad hoc created graphical user interface was used to concurrently visualize and time align the signals.

Recordings from all subjects were visually inspected to extract segments free of artifacts due to subject/sensor/pad movement or incorrect skin-pads contact. No segments were extracted from Hyperventilation and Ambulance Tour phases. During Hyperventilation phase the fast and abrupt way of breathing introduced movement artifacts in the signals, while vehicle movement during the Ambulance Tour drastically corrupted the BCG recordings and made analysis impossible.

The study dataset was composed of a total of 125 segments, 27/19/29/50 from Phases 1/3/4/6, containing the ECG, BCGc, BCGa and CO signals. The mean (standard deviation, SD) duration of all the segments was 48.6 (42.3) s, while for Phases 1, 3, 4 and 6 the duration was 41.1 (27.6) s, 30.1 (7.1) s, 45.7 (28.0) s, and 61.3 (57.6) s, respectively.

Methods

A machine learning based model to estimate the CO using morphological features extracted from BCG signals was developed in this study. The approach first preprocesses ECG, BCGc and BCGa, then adaptively filters BCGc and BCGa to extract their circulatory-related components and builds a multiple linear regression model to estimate the CO using features extracted from the circulatory-related components of BCGc and BCGa.

Signal processing and feature computation

The ECG and BCG signals were first preprocessed using band-pass filters between 0.5–30 Hz and 0.5–3 Hz, respectively, to suppress the baseline drift and high frequency noise. The instants of QRS complexes were then automatically detected in the preprocessed ECG using the Hamilton-Tompkins algorithm30. Afterwards, BCGc and BCGa signals were adaptively filtered using a recursive least square (RLS) algorithm31,32. The RLS algorithm used the instants of QRS complexes as reference for tracking and extracting the circulatory component from BCGc (CCc) and BCGa (CCa) signals, respectively. Figure 2 shows the preprocessed (left) and adaptively filtered (right) signals in a 15-s segment from Phase 3 (hypoventilation). Both CCc and CCa reflect a fluctuation after each heartbeat. A detailed technical description of the method for estimating the circulatory component is given in the supplementary materials.

Figure 2
figure 2

Adaptive filtering of BCG signals to extract the circulatory-related component. Preprocessed ECG, BCGc and BCGa are represented from top to bottom in the left panel. While preprocessed ECG and circulatory-related components of the BCGc (CCc) and BCGa (CCa) are depicted from top to bottom in the right panel. Dark blue circles represent the instants of the QRS complexes.

Segments were assessed using 10-s analysis windows with an 80% overlap between consecutive windows as shown in Fig. 3. A total of 66 waveform features were computed to characterize each 10-s analysis window of CCc and CCa. Specifically, 33 features were calculated from each circulatory component. The first three features were intended to describe the statistical distribution of the circulatory component and were the standard deviation, skewness and kurtosis, respectively. The last 30 features corresponded to the median and standard deviation of 15 morphological characteristics computed on a beat-to-beat basis and with the aim of characterizing the amplitude (5), duration (4), area (3) and length (3) of the fluctuations in the circulatory component. Concretely, amplitude-related features corresponded to the peak amplitude, onset-to-peak, offset-to-peak, and the maximum and the mean between onset-to-peak and offset-to-peak amplitudes. The four duration-related features were the onset-to-peak, peak-to-offset and total durations, and the pulse width, i.e. the duration of the upper half of the fluctuation. The three features characterizing the area of the fluctuation were the onset-to-peak, peak-to-offset and total area, while length of the fluctuation was measured by computing the curve length from onset to peak, from peakt to offset, and from onset to offset (total length). The complete description of the morphological features can be found in the supplementary materials.

Figure 3
figure 3

Example of a 60-s segment analyzed corresponding to Phase 4 (Trendelenburg). From top to bottom, the CO, preprocessed ECG, CCc and CCa are depicted. Dark blue filled dots in CCc and CCa represent the maxima of the fluctuations caused by each heartbeat. The large shaded rectangles in the axes illustrate the first window analyzed, whereas smaller rectangles at the bottom represent the relative position of consecutive analysis windows.

Framework for the CO estimation model

Segments were randomly and patient-wise split into development (75%) and validation (25%) sets. Thus, the development/validation sets contained all the segments corresponding to the 15/5 patients randomly allocated to each set, respectively. The development set was used to select the best feature subset and to build the model based on the selected features, while the validation set was used to evaluate the performance of the model. Segments were assessed using 10-s analysis windows with an 80% overlap between consecutive windows as shown in Fig. 3. That is, for every 10-s window analyzed the median values of the 66 features were calculated and fed into the model to estimate the median CO value within the window. Thus, a total of 2471 analysis windows were divided 1939/532 into development/validation sets respectively.

Feature selection was carried out in the development set through a fivefold cross-validation (CV) scheme using a wrapping approach33. The baseline model was a linear regressor based on the total duration of the fluctuation in CCc since this feature was the most relevant one according to the minimum redundancy maximum relevance algorithm34. Additional features were then added, one at a time, to the baseline model using the Plus-l Take-Away-r, PTA(l,r), algorithm35 with l = 3 and r = 2. Briefly, PTA(3,2) is a hybrid feature selection algorithm that at each iteration adds the most relevant feature to the K-feature model by stepwise including the best 3 features (a total of K + 3) and right after, removing stepwise the least relevant 2 features obtaining the best (K + 1)-feature model. The criterion to include/exclude a feature was the minimization of the mean squared error (MSE, see Eq. 2) between the CO value estimated by the model and the CO value provided by the Hemosphere (ground truth) across the fivefold composing the CV scheme. Right after selecting a feature, its squared transformation was also included in the model as long as it resulted in an MSE decrease. Features were added until no MSE decrease was obtained and therefore, the best K-feature subset was selected. Once features were selected, the whole development set was used to build a multiple linear regression model based on the selected best K-feature subset.

Finally, the validation set was used to evaluate the performance of the model in terms of (1) Bland–Altman plots and their corresponding limits for the 95% level of agreement (LOA95%) and (2) performance metrics such as mean absolute error (MAE), MSE and percentage of error (PE, as proposed by Critchley et al.30) defined by the following equations:

$${\text{MAE}} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left| {y_{i} - \widehat{{y_{i} }}} \right|$$
(1)
$${\text{MSE}} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {y_{i} - \widehat{{y_{i} }}} \right)^{2}$$
(2)
$${\text{PE}} = 100\frac{1.96 \sigma }{{\overline{y}}}$$
(3)

where n represents the number of analysis windows in the dataset, \({y}_{i}\) and \(\widehat{{y}_{i}}\) are the ground truth and the model estimated CO for the ith analysis window respectively, \(\sigma\) represents the SD of the error [real CO—estimated CO] and \(\overline{y }\) depicts the mean real CO.

Statistical analysis

Quantitative variables were expressed as mean (SD)/median (interquartile range) if they did/did not pass the Lilliefors’ normality test. Qualitative variables were expressed as absolute value and percentage, N (%). For comparing normal/not-normal quantitative variables by gender, the t-test/Mann–Whitney U test were performed to test for equal means/medians, respectively. A p value < 0.05 was considered statistically significant.

Results

Table 1 summarizes the baseline patient characteristics of the 20 healthy subjects enrolled in the study. Subjects were predominantly female (70%) with statistically significant differences in height and chest circumference at the xiphoid process level (CCX).

Table 1 Baseline patient characteristics stratified by gender.

Figure 4 represents the MSE obtained by a multiple linear regression model based on K-features in the fivefold CV scheme used in the development set. In other words, it shows the MSE of the model as an increasing number of features are included. There is a sharp decrease in MSE till four features are included. Then, there are slight and consecutive MSE decreases until the minimum is reached by the model containing nine features (K = 9).

Figure 4
figure 4

The mean squared error (MSE) in the development set as a function of the number of features included in the model.

Thus, the final regression model including the selected K = 9 features (details in Table 1 of the supplementary materials) was trained using the whole development set and used to produce CO estimations, \(\widehat{y}\), as described in the following equation:

$$\begin{aligned} \hat{y} & = \beta_{0} + \beta_{1} v_{1} + \beta_{1}^{\prime } v_{1}^{2} + \beta_{2} v_{2} + \beta_{3} v_{3} + \beta_{3}^{\prime } v_{3}^{2} + \beta_{4} v_{4} + \beta_{4}^{\prime } v_{4}^{2} + \beta_{5} v_{5} + \beta_{5}^{\prime } v_{5}^{2} \\ & = 6.70 - 3.42v_{1} + 2.47v_{1}^{2} - 0.31v_{2} + 0.19v_{3} + 0.05v_{3}^{2} - 1.11v_{4} + 1.00v_{4}^{2} - 0.01v_{5} + 0.24v_{5}^{2} \\ \end{aligned}$$
(4)

where \({\beta }_{j}\) and \({\beta }_{j}{\prime}\) are the regression coefficients and \({v}_{1}-{v}_{5}\) represent the standardized (to mean/SD equal to zero/one) version of the following features: the total duration of the fluctuation in CCc (\({v}_{1}\)), curve length from peak to offset of the fluctuation in CCc (\({v}_{2}\)), skewness of CCa (\({v}_{3}\)), onset to peak duration of the fluctuation in CCa (\({v}_{4}\)), and curve length from peak to offset of the fluctuation in CCa (\({v}_{5}\)) for the ith analysis window. The large magnitudes of the regression coefficients for duration related features (\({v}_{1},{v}_{4}\)) in both CCc and CCa reflect, among all features, the strongest association with the CO (p < 0.001). This fact is also corroborated by the existing strong negative linear correlation between the features and CO that showed a Pearson correlation coefficient of − 0.68 and − 0.38 for \({v}_{1}\) and \({v}_{4}\), respectively. We hypothesized that greater fluctuation durations might be a result of lower heart rate and consequently, lower CO.

Figure 5 shows the Bland–Altman plots for development (left panel, \({R}^{2}=0.39\)) and validation (right panel, \({R}^{2}=0.67\)) sets. The LOA95% ranged from − 2.01 to 2.01 and from − 2.18 to 1.89 for development and validation sets, respectively. The model (Eq. 4) showed a MAE, MSE and PE of 0.80/0.83 L/min, 1.06/1.10 L2/min2 and 30.1%/30.2% for development/validation sets, respectively. Bland–Altman plots showed how our model slightly overestimated for CO values below 4.5 L/min and substantially underestimated for CO values above 9 L/min. For normal CO values between 4.5 and 8 L/min the error was more restrained between [− 1, 1] range. We hypothesized that these errors might be a byproduct of the optimization process of the model. The dataset of the study contained 20 healthy subjects with normal CO so that the model was trained to expect and estimate normal values (around 6.7 L/min, i,e, the intercept, \({\beta }_{0}\), of the model). Thus, when very low CO and high CO values were to be estimated the model produced estimations by subtracting and adding respectively from its baseline CO (\({\beta }_{0}=6.7\) L/min), which resulted in an overestimation and underestimation.

Figure 5
figure 5

Bland–Altman plots representing the error defined as [real CO—estimated CO] as a function of the ground truth (CO computed by the HemoSphere, HS) for development (left) and validation (right) sets, respectively. Dashed line represents the mean error and dash-dotted lines represent the LOA95%.

Figure 6 illustrates the evolution of the absolute error across the different phases of the study protocol for development (left panel) and validation (right panel) sets, respectively. Similar trends can be found for both sets, although more emphasized for the development set. There is a substantial error increase from Normoventilation to Hypoventilation phase. This increase might be due to the delayed effect of the Hyperventilation phase inbetween both phases. The movement artefacts introduced by the fast and abrupt way of breathing in the Hyperventilation phase negatively affected most of the CO measurements provided by the HemoSphere monitor in the Hypoventilation phase (total duration of around 0.5 min) as the device computes CO every 2 s based on the last 20 s invasive arterial blood pressure. During Trendelenburg and Infusion phases the error decreased and stabilized. The abnormally high error presented in Normoventilation phase for the validation set compared with that obtained for the same phase in the development set is caused by the reduced number of subjects (a total of five) in the validation set and the fact that subject number one is included among those subjects. During the first phase of the first subject, i.e. the very start of the study, the control, interpretation, interiorization and implementation of the study protocol by clinicians and subjects was not optimum and resulted in lower quality of the recordings that increased the error in Normoventilation phase.

Figure 6
figure 6

Boxplots showing the absolute error distribution across phases for development (left) and validation (right) sets, respectively.

Discussion

This is the first study comparing invasive cardiac output technology with non-invasive BCG technology in realistic scenarios of emergency medicine. Our multiple linear regression model showed promising performance estimating the CO. The PE reported was only slightly above the maximum 30% PE limit proposed by Critchley et al.36 for considering CO measurements interchangeable, while those PE reported by other minimally invasive or non-invasive methods were on average well above that 30% limit regardless of the ground truth used37. When compared against the most widely accepted gold standard (thermodilution using PAC), the pulse contour analysis based technologies and the esophageal Doppler yielded a mean (SD) PE of 41.3 (2.7)% and 42.1 (9.9)% respectively in a systematic review conducted by Peyton et al.38. According to the review conducted by Joosten et al.39 the bioimpedance, CO2-rebreathing, volume clamp method and pulse wave transit time showed a weighted mean PE of 42%, 40%, 56.3% and 62%, respectively. Similarly, when compared against a different reference method, namely pulse contour analysis, a weighted mean PE of 52.9%, 39.5%, 39.2% and 44.9% were reported for bioimpedance, biorereactance, volume clamp method and esophageal echoDoppler respectively39,40,41. Nevertheless, in clinical practice PE is only one metric used to evaluate methods for CO estimation, the ability to track changes in CO during challenging conditions might be as valuable in a hemodynamic unstable patient, or during cardiac arrest, as the ability to give accurate single measurements under stable conditions38.

The present study documents, to our knowledge, a promising first step to an accurate and reliable estimation of CO using signals that can be easily recorded in the prehospital setting. BCG biosensors are (1) very easy and fast to deploy, (2) recording and processing of BCG signals might be done either in a standalone device or even integrated into commercial monitor/defibrillators with appropriate hardware and software modifications, and (3) CO estimation would be fully automated avoiding technically difficult to operate equipment and the need for well-trained personnel to get reliable CO readings. Regarding precision, our model showed a MAE of 0.83 L/min in the validation set which represents a relative error around 12% (the mean CO of 6.7 L/min). We hypothesize that the precision of the method may be improved in future studies by first adding uncorrelated information extracted from transthoracic impedance, capnography and photoplethysmography signals recorded by the monitor/defibrillators in the prehospital setting and second using more complex, and hence less interpretable, machine learning models such as ensemble of decision trees or artificial neural networks.

This study presented some limitations. First, this was an observational quality controlled scenario based study of 20 healthy subjects. Future studies should therefore consider a larger sample size including emergency medicine patients with compromised blood flow changes to corroborate these findings. We decided not to include measurements during Hyperventilation phase. The movements required by the body to allow for following the rapid and abrupt way of breathing introduced artifacts that distorted the isolated circulatory-component and made it difficult to reliably compute the morphological features. Finally, the gold standard used was the CO measured via pulse wave contour analysis of the invasive arterial blood pressure. As stated before, the PE showed by pulse contour analysis based technologies indicated no interchangeable measures with those provided by the reference method (thermodilution). Nevertheless, pulse contour analysis has been reported to reliably track changes in CO which is key to optimize the treatment of emergency medicine patients by healthcare personnel. Ideally, thermodilution should be used. However, it is an invasive technique requiring special skills and apparatus not always achievable in the prehospital setting with the patient on an ambulance stretcher. Thus, pulse contour analysis is, to the best of our knowledge, the most suitable minimally invasive alternative to be considered as gold standard in the prehospital setting.

Conclusion

This study presented a method to estimate the CO based only on morphological features extracted from carotid and abdominal BCG biosensors in healthy subjects. The PE and MAE showed by the method evidenced the potential of the BCG to help reliably estimate and track the CO. This method may be considered as the first step towards an automated, non-invasive, easy-to-deploy, cheap and reliable CO estimator, especially useful in prehospital emergency medicine to guide therapy, optimize treatment, and ultimately, contribute to increase survival rate.