Introduction

Heart rate variability (HRV), which is defined as changes in heart rate over time1, is an indicator that is used to evaluate the activity of the cardiac autonomic nervous system (ANS) because of its relationship with the parasympathetic and sympathetic activity directed into the sinus node in the heart, which controls the heart rate2,3. Several authors have found that HRV is associated with cardiovascular conditions, such as myocardial infarction and heart failure4, coronary artery disease, and sudden death5. As explained in6, HRV has also been found to aid in the diagnosis and increase the prognostic value of predisposing conditions for critical illness, such as hypertension, and some HRV parameters have been found to be abnormal even in the early stages of hypertension.

HRV is measured using electrocardiographic signals (ECG), which represent the electrical activity generated by the heart conduction system3,7, and standards of measurement have been established to align the methodologies used in HRV studies, in order to allow for comparisons among results8. Nonetheless, in recent years, several studies have reported obtaining information similar to HRV from other signals that also contain information related to the cardiac cycle, such as pulse waves. One technique that has attracted significant attention for detecting pulse-wave-related HRV, also known as pulse rate variability (PRV), is photoplethysmography (PPG)9, which is a noninvasive, simple, and inexpensive technique that utilizes optical principles to obtain the pulse wave from the microcirculation in peripheral tissue10,11. Various studies have investigated PRV changes under different conditions, such as in the presence of mental or somatic diseases or during sleep, and used them to evaluate the effects of pharmacological drugs on ANS responses8. There has also been a special interest in the study of PRV under cardiovascular conditions, such as diabetes, hypo- or hypertension, or cardiac arrhythmias1.

Although some studies have shown that PRV is a promising technique for identifying several conditions and that PRV is highly correlated with HRV, these results have mainly been observed in healthy or resting subjects in the supine position1. On the other hand, some studies have argued that PRV is not necessarily a good surrogate for HRV, mainly because of the errors made when performing the processing and acquisition methods and physiological factors, such as changes in pulse transit time (PTT)1,12. As explained in refs. 13 and 14, PTT plays an important role in the differences that are seen between PRV and HRV.

PTT is the time it takes for the pulse wave to travel from the heart to the peripheral tissue where it is being measured, and it has been shown to be related to blood pressure (BP)15. BP refers to the force that the heart uses to pump blood through the circulatory system and is one of the main measurements used to understand the behavior of the cardiovascular system16. Its associated abnormalities, especially hypertension (i.e., high BP), are associated with fatal cardiovascular diseases17.

Because of the relationship between PTT and BP and the effects of PTT on PRV measurements13,14, the aim of this study was to evaluate the relationship between HRV and PRV measured from ECG and PPG signals, respectively, obtained from critically ill patients with hypotension, hypertension, or normotension. It was hypothesized that HRV and PRV would not exhibit the same behavior and that their relationship would be affected by the BP state. To evaluate these hypotheses, signals obtained from the public MIMIC-III database from Physionet were analyzed, and PRV and HRV indices were extracted and compared to assess the relationship between HRV and PRV.

Results

Signal selection and segmentation

Data used in this study was obtained from the freely available MIMIC-III Waveform18,19. The dataset was filtered according to the length of the recordings and the quality of the ABP signals for each patient. In total, 230 records with poor-quality (PQ) ABP signals and a short duration were discarded. The signals from the remaining 270 records were used in the subsequent analysis. All signals were segmented in 5-min length segments with 10 s overlap between consecutive segments. In total, 4937 5-min segments were extracted, of which 54% were labeled as hypertensive, 25% were labeled as hypotensive, and the remaining 22% were labeled as normotensive events, according to the ABP signals.

Extraction of indices

Time- and frequency-domain and nonlinear indices were obtained from HRV and PRV signals (Table 1). Table 2 summarizes the behavior of these indices when measured during hypotension, normotension, and hypertension.

Table 1 Indices extracted from pulse rate variability (PRV) and heart rate variability (HRV).
Table 2 Mean ± standard deviation of indices measured from pulse rate variability (PRV) and heart rate variability (HRV) under each blood pressure state.
Table 3 Friedman rank sum tests results for the comparison between pulse rate variability and heart rate variability in the different blood pressure states. ‡: p value less than 5.00 × 10−2; †: p value less than 5.00 × 10−3; : p value less than 5.00 × 10−4.

Correlation between PRV and HRV indices

The correlation between indices measured from PRV and HRV was assessed and the obtained results are summarized in Fig. 1. Most of the indices showed a good correlation between HRV and PRV during all three BP states, although some of the indices tended to show a lower correlation during normotension. Interestingly, the entropy- and phase-derived indices had lower correlations. In addition, lower correlation coefficients were observed for indices associated with short-term changes, such as RMSSD, SD1, HF, and A1.

Fig. 1: Linear relationship between pulse rate variability and heart rate variability.
figure 1

Spearman correlation coefficients (ρ) were measured between a time-domain indices, b absolute and entropy indices from the frequency domain, c relative indices from the frequency domain, d centroid-related indices from the frequency domain, e Poincaré plot indices, f entropy-related indices, g phase-related indices, and h indices resulting from the detrended fluctuation analysis. All indices were obtained from pulse rate variability and heart rate variability in each blood pressure state (hypotension, normotension, and hypertension).

Comparison between HRV and PRV using the Friedman rank sum test

Because most of the data obtained from the different indices were non-normally distributed and did not comply with the assumption of homogeneity of variances, Friedman rank sum tests were used as a nonparametric alternative to repeated measures ANOVA to compare the indices measured from HRV and PRV under different BP states. The results from these tests for the comparison between HRV and PRV are shown in Table 3. For most indices, there were significant differences (p value < 0.001) between HRV and PRV, regardless of the BP state. Only VLF and cTPy showed nonsignificant differences between the two measurement sources during normotension.

Table 4 Kruskal–Wallis and post hoc multiple comparisons p values for the comparison among blood pressure states from indices measured from pulse rate variability. ‡: p value less than 5.00 × 10−2; †: p value less than 5.00 × 10−3; : p value less than 5.00 × 10−4.

Comparison between BP states using Kruskal–Wallis tests

Kruskal–Wallis tests, a nonparametric alternative to one-way ANOVA, were used to compare indices among BP states when measured using PRV or HRV. Pairwise Wilcoxon tests were used as post hoc analyses when the Kruskal–Wallis results indicated significant differences. The results are shown in Tables 4 and 5, and there were statistically significant differences among BP states from all indices, except for cHFx measured using HRV. The pairwise comparisons revealed that most of the indices showed differences among the three stages, especially when measured using PRV.

Table 5 Kruskal–Wallis and post hoc multiple comparisons p values for the comparison among blood pressure states from indices measured from heart rate variability. ‡: p value less than 5.00 × 10−2; †: p value less than 5.00 × 10−3; : p value less than 5.00 × 10−4.
Table 6 Ratio of agreement (BAR, %) derived from Bland–Altman analysis as suggested in refs. 1,6. Good agreement (): BAR < 10%; moderate agreeement ( ↔ ): 10% ≤ BAR < 20%; insufficient agreement (): BAR ≥ 20%.

Bland–Altman analysis to assess agreement

Because neither correlation analyses nor ANOVA could be used to evaluate the agreement between HRV and PRV measurements, Bland–Altman analyses, as proposed in20, were performed for each extracted index. Bias and LoAs were measured, and the results are summarized in Figs. 2 and3.

Fig. 2: Bias measured from the Bland–Altman analysis comparing indices extracted from pulse rate variability and heart rate variability.
figure 2

The bias was obtained between measured indices when comparing pulse rate variability and heart rate variability in each blood pressure state (hypotension, normotension, and hypertension) using the Bland–Altman analysis. Indices analysed were a time-domain indices, b absolute and entropy indices from the frequency domain, c relative indices from the frequency domain, d centroid-related indices from the frequency domain, e Poincaré plot indices, f entropy-related indices, g phase-related indices, and h indices resulting from the detrended fluctuation analysis.

Fig. 3: Differences between upper (LoAU) and lower (LoAL) limits of agreement obtained from Bland–Altman analysis comparing indices measured from pulse rate variability and heart rate variability.
figure 3

The limits of agreement were obtained from the Bland–Altman analysis performed with the extracted indices in each blood pressure state (hypotension, normotension, and hypertension). Indices analysed were a time-domain indices, b absolute and entropy indices from the frequency domain, c relative indices from the frequency domain, d centroid-related indices from the frequency domain, e Poincaré plot indices, f entropy-related indices, g phase-related indices, and h indices resulting from the detrended fluctuation analysis.

As shown in Fig. 2, most of the indices were overestimated when measured from PRV. Some others were underestimated, such as nLF, nHF, LF/HF, SSE, D2, A1, and A2. Indices associated with short-term changes were especially overestimated when measured from PRV. Although a general conclusion is difficult to be achieved, for most of the indices the bias differed according to the BP state. Interestingly, most indices showed a larger absolute bias during normotension. A similar trend was observed in the differences between the upper and lower LoAs, with large differences especially in indices associated with short-term changes. The largest differences were observed for D2, SampEn, and most of the frequency-domain indices. NN50 and pNN50 showed a bias and difference between LoAs of zero, which shows very good agreement between HRV and PRV.

The BAR results are shown in Table 6. For NN50 and pNN50, the bias and difference between LoAs were equal to zero for all conditions, and thus, the ratio was not measured because the agreement was total. The agreement tended to remain as good, moderate, or insufficient regardless of the BP state. Most of the indices that showed insufficient agreement are associated with short-term changes.

Discussion

HRV has been proposed as a useful, noninvasive, indirect measurement of the cardiac ANS. It has been used for several decades as an indicator of parasympathetic and sympathetic activity3, and it has been studied as a biomarker for a broad range of diseases. However, it has been found that the measurement of HRV in real-life scenarios can be impaired by several conditions, especially because of the cumbersome instrumentation needed for the acquisition of the ECG signals, which has, to some extent, precluded the usefulness and acceptance of HRV as a tool for clinicians to diagnose and monitor diseases, and for larger public health applications6. Hence, several researchers have started to investigate the possibility of replacing HRV information with a very similar signal, PRV, which is based on pulse waves that are easier to obtain and more ubiquitous, such as PPG signals9. Nonetheless, the promise of PRV as a valid surrogate for HRV has been questioned, and some studies have concluded that, although they are very similar, PRV and HRV are not exactly the same and that PRV may not be a suitable surrogate for HRV, especially when measured in disease states and in older subjects1,12.

Various explanations for the differences between HRV and PRV have been given. Some authors argue that the differences are mainly due to processing issues, such as the identification of fiducial points from the PPG signal21,22, the sampling rate used for the acquisition of the signals23,24,25, and the processing techniques used for the analysis of PRV26. However, other authors have suggested that, although these factors may affect PRV, physiological issues may have a more profound effect on the differences between these two signals12,27. The relationship between HRV and PRV may be affected by not only PTT but also other factors, such as external forces on the arterial vessels28, the presence of pathologies, including cardiovascular disorders28,29,30, and the body location at which PRV is being measured8,31. An important contributor to these differences is respiratory activity, which affects vasoconstriction and modulates aortic and left-ventricular pressure, altering the time of opening of the aortic valve during the cardiac cycle32. Nonetheless, several processes take part in the information transmission from the pure electrical ECG and the R waves to the mechanical PPG pulse wave, as is explained in ref. 33. These factors may also explain in part the differences observed between HRV and PRV, especially under non-resting conditions and in non-healthy, older subjects. Hence, PRV should not be considered a surrogate of HRV, but should be treated as an independent biomarker instead, which may contain additional information not available in HRV33.

Thus, because of the differences observed in previous studies between HRV and PRV, the aim of this study was to assess the relationship between these two signals in critically ill subjects hospitalized in intensive care units. Because these subjects exhibited changes in BP, PRV, and HRV were also compared when the subjects experienced normotension or hyper- and hypotensive events.

A first analysis was performed comparing PRV and HRV indices using linear correlation. Among the time-domain indices, an almost perfect correlation was observed between HRV and PRV for AVNN, regardless of BP state. For SDNN, NN50, and pNN50, higher correlations were observed as BP increased, but pNN50 had the lowest correlation of the time-domain indices in all BP states. RMSSD, which reflects short-term changes in HRV and PRV2, had a lower correlation, with a correlation coefficient below 0.8, during normotension.

The frequency-domain indices exhibited different trends. Absolute indices (VLF, LF, HF, and TP) and the ratio between LF and HF (LF/HF) had stable and high correlations. For normalized indices, on the other hand, there were differences according to the BP state, with better correlations during normotension. Spectral entropy (SpEn) exhibited a behavior similar to that observed for RMSSD. Indices related to the centroid of the frequency bands in the y-coordinate were relatively stable, while the x-coordinate of the centroids featured the worst correlations, especially for the x-coordinate of the HF band centroid.

The correlations for the nonlinear indices were, in general, worse than those for the time- and frequency-domain indices. This was especially true for the phase-related indices, the correlation dimension and the Lyapunov exponent. Among the Poincaré plot indices, SD2 had a nearly perfect correlation between HRV and PRV, while SD1/SD2 and SD1 exhibited behaviors similar to that of RMSSD. The correlations for the entropy-related indices were also relatively poor. The correlation for the BSE values was strongly affected by normotension, and although SampEn, ApEn, and SSE were stable regardless of the BP state, their correlation coefficients were low. Finally, A1 and A2 from the detrended fluctuation analysis behaved similarly to SD1 and SD2, respectively, probably because of the differences between the short- and long-term changes in HRV and PRV.

PRV and HRV were also compared using a Friedman rank sum test, as a nonparametric alternative of a repeated-measured ANOVA. The results indicated that there were differences between PRV and HRV in all BP states for all indices, except for the measurements of VLF and cTPy, which showed nonsignificant differences between HRV and PRV during normotension. These two indices need to be considered with care because they are probably a reflection of long-term changes, especially VLF, and require recordings longer than 5 min. In general, these results indicate that PRV and HRV are not the same, regardless of the BP state.

In addition, it was also determined if there were individual differences in PRV and HRV among BP states. The Kruskal–Wallis test results revealed that, in general, both HRV and PRV were different among hypotension, normotension, and hypertension states. The only index for which there was not a statistically significant difference was cHFx when measured from HRV. Based on the post hoc comparisons, it was concluded that PRV showed more differences than HRV. Again, as was observed for the correlations, these differences were especially observed in the nonlinear indices and in indices reflecting short-term changes, such as RMSSD, SD1, SpEn, and A1. Interestingly, most of the differences were observed when normotension was compared to either of the two other BP states.

Finally, agreement between HRV and PRV was assessed using Bland–Altman analysis and three measurements were obtained from these to evaluate the agreement between HRV and PRV: the bias, difference between limits of agreement, and BAR. For the time-domain indices, SDNN and RMSSD were overestimated when using PRV, whereas AVNN, NN50, and pNN50 had a bias close to zero. The absolute-power frequency-domain indices were also overestimated when using PRV, especially HF, LF/HF, and TP. On the contrary, the relative power indices were usually underestimated. The y-coordinate of the centroid of the HF band was also largely overestimated when using PRV. This same trend was observed for all Poincaré-plot indices, as well as for SampEn and ApEn, whereas SSE, D2, and both A1 and A2 were underestimated. The degrees of over- and underestimation tended to be larger during normotension. This same trend was observed for the limits of agreement: larger differences were observed during normotension and when some short-term indices, such as RMSSD, HF, cHFy, SD1, and A1, were measured.

The obtained BARs indicated good agreement for AVNN, VLF, LF, HF, TP cHFx, cLFy, cHFy, cTPy, SpEn, SD2, BSE, SSE, MSE, LYA, and A2. Insufficient agreement was observed for SDNN, RMSSD, nLF, nHF, LF/HF, cLFx, cTPx, S, SD1, SD1/SD2, COM, ApEn, SampEn, D2, and A1. Most short-term indices showed an extremely large BAR, which indicates a very poor agreement between HRV and PRV for the measurement of these indices in critically ill patients regardless of the BP state. There was no indication that BP changes caused significant changes in the agreement.

In conclusion, the obtained results indicate that PRV and HRV were not the same regardless of the BP state of the subjects, especially when nonlinear indices and indices associated with short-term changes were analyzed, which agrees with the results of previous studies8,14,30,34. Interestingly, the differences tended to be larger during normotension. However, although they are not the same and PRV tends to over- or underestimate HRV, both signals behave similarly in most cases. Nonetheless, the Kruskal–Wallis results indicate that PRV seems to be more sensitive to changes in BP. This could be considered as an indication that PRV contains additional information not available in HRV, which might help increase the applicability of the technique in clinical scenarios. Moreover, the widespread use of PPG in wearable devices is generating a lot of research with PRV, which is aiming to apply this more practical technique, in comparison with HRV, for the diagnosis and monitoring of several physiological phenomena related to disease (i.e., cardiovascular disease, mental health). This, in turn, shows the applicability and potential of PRV for public health studies and, hence, for screening subjects that may need later further analyses in the clinical setting, with more specialized tools.

Future studies are needed to clarify the origin of the differences between HRV and PRV and to evaluate the capability of PRV to identify BP states, which aid in the noninvasive, continuous measurement of BP. Moreover, it is critical that measurement and analysis guidelines and standards are adopted for PRV studies. This would enhance the quality of the research in this field, allowing the comparability among results obtained in different studies, and possibly increasing the applicability of PRV in clinical settings.

This study has several limitations. First, the signals used were obtained from an available database from Physionet. Thus, several variables were not controlled for, and although all subjects were hospitalized in an intensive care unit, their diagnosis was unknown, which may have affected the results. Another limitation involves the segmentation of the data into 5-min-long segments, which may have been too short for the extraction of some indices, especially frequency-domain indices. However, this length was considered necessary to obtain as many segments as possible during each BP state and still ensure sufficient data for the PRV and HRV analysis. Moreover, an overlap of 10 s was used to separate the segments, which might have been too short to reflect BP changes. Again, this was done to produce a larger database. Another limitation of the study involves the classification of segments in each BP state, specifically determining exactly which state was predominant in each segment, especially in subjects who exhibited two or more BP states during the entire recording. A larger number of available segments might have helped to mitigate this effect, and outliers for each PRV and HRV index were corrected. It is important to mention that MIMIC-III database lacks synchronicity35, which could bias the results obtained in this study. Finally, it is also worth noting that some of the extracted indices were not optimized, especially the nonlinear indices such as Poincaré-plot indices, BSE, SSE, phase indices, and DFA-related indices. Using an optimization procedure for these indices might lead to different results than those obtained in this study, as can be seen in ref. 30.

Methods

Signal selection

A subset of 500 records was obtained from the MIMIC-III Waveform Database18,19. Each record in the subset contained the ECG, PPG, and invasive arterial BP (ABP) signals, which were obtained at a 125 Hz sampling rate, from critically ill subjects in adult intensive care units. As the MIMIC-III is a publicly available database, ethical approval was not required for this study.

These records were filtered to reject PQ signals and signals with length of less than 5 min. First, signals with a duration of less than 5 min were discarded. Then, a signal quality index (SQI) algorithm was employed to detect good- and poor-quality ABP signals, as shown in Fig. 4. In the signal quality assessment algorithm, the onsets from each ABP signal were detected by applying the algorithm described in ref. 36, and the cardiac cycles were obtained. Then, the quality of each cardiac cycle was assessed using SQIs proposed in the literature37,38,39. A K-means clustering algorithm was employed to automatically group good-quality (GQ) and poor-quality (PQ) cardiac cycles in two clusters, with the SQIs used as features. Because it was expected that most of the cycles would be of good quality, the larger cluster was considered the GQ cluster. Then, the ratio (RGQ) between the number of cycles grouped as “good-quality cycles” and the total number of cycles was obtained as in (Eq. 1). The records with a RGQ greater than or equal to 80% were considered GQ signals, and the remaining records were discarded.

$${R}_{{\mathrm{GQ}}}=(100 \% )\frac{{n}_{{\mathrm{GQ}}}}{{n}_{{\mathrm{GQ}}}+{n}_{{\mathrm{PQ}}}}$$
(1)
Fig. 4: Signal quality assessment algorithm.
figure 4

This algorithm was applied for discarding low-quality arterial blood pressure signals.

Signal processing

MATLAB® (version 2020a) was used for signal processing. ECG, PPG, and ABP signals were segmented into 5-min-long segments, with an overlap of 10 s between consecutive segments.

After segmentation, 5-min-long ABP signals were filtered using a 12 Hz, fourth-order, lowpass Butterworth filter. Peaks and onsets were detected, corrected, and interpolated using a cubic spline to obtain systolic blood pressure (SBP) and diastolic blood pressure (DBP) trends . From the SBP and DBP information, events of hypertension (SBP greater than 140 mmHg or DBP greater than 90 mmHg) and hypotension (SBP lower than 90 mmHg or DBP lower than 60 mmHg) were identified. Then, each 5 min segment was labeled as hypertension, normotension, or hypotension according to the most frequent label in each 5 min segment. An example of these trends and labels is shown in Fig. 5.

Fig. 5: Arterial blood pressure signal analysis.
figure 5

Example of the analysis of a 5-min arterial blood pressure (ABP, gray line) signal, with the trends for systolic (SBP, continuous line) and diastolic (DBP, dotted line) blood pressure, as well as the determination of blood pressure state (BP state, dashed line). Hypo hypotension, Normo normotension, Hyper hypertension.

For HRV analysis, R peaks were detected from each 5-min segment obtained from the ECG signals, using the algorithm proposed in ref. 40. HRV was measured as the time difference, in milliseconds, between consecutive R peaks. For the frequency-domain analysis, the uneven HRV series was interpolated using a cubic spline interpolation and a sampling rate of 4 Hz, and the power spectrum was obtained using the Fast Fourier Transform (FFT). Outliers in both the original and interpolated time series were defined as values higher or lower than the mean value plus or minus 1.96 times the standard deviation of the series. These outliers were then replaced with the mean value of the five previous values in the time series. Figure 6a illustrates this process.

Fig. 6: Electrocardiography and photoplethymography analysis for the extraction of heart rate variability and pulse rate variability, respectively.
figure 6

Example of a an electrocardiography (ECG) and b a photoplethysmography (PPG) signal. R peaks (black circles on the ECG signal) were detected from ECG signals to measure heart rate variability (HRV) as the time interval between consecutive R peaks (RR intervals). Onsets (black circles on the PPG signal) were detected from PPG signals to measure pulse rate variability (PRV) as the time interval between consecutive onsets (PP intervals).

Similarly, after segmentation, the onset of each cardiac cycle from the 5-min-long PPG signals was obtained as the intersection point of the tangent lines arising from the maximum slope point and the valley of the waveform. This fiducial point was selected because of its robustness for PRV analysis21,41. PRV was measured as the time difference, in milliseconds, between consecutive onsets from the PPG signal, as shown in Fig. 6b. Similar to the method used for HRV trends, PRV was interpolated using a 4 Hz sampling rate and a cubic spline interpolation, and outliers were detected and corrected. Again, FFT was used to obtain the power spectrum.

As was explained previously, time- and frequency-domain indices and nonlinear indices extracted from Poincaré plot, entropy, phase, and detrended-fluctuation analyses were obtained, as summarized in Table 1.

Statistical analysis

All statistical analyses were performed in MATLAB® and R (version 3.6.1). A significance level of 5% (p value < 0.05) was considered significant for all analyses, and the normality of data was assessed using a Lilliefors test.

The aim of this study was to assess the differences between HRV and PRV indices extracted from critically ill subjects during hypo-, normo-, and hypertensive events. Hence, the level of the linear relationship between PRV and HRV indices was evaluated using the Spearman correlation coefficient. The differences between HRV and PRV were also evaluated using Friedman rank sum tests, and the differences among BP states were assessed using Kruskal–Wallis tests, with pairwise Wilcoxon tests with Bonferroni correction as post hoc analyses.

Moreover, because a good correlation does not imply good agreement, the agreement between HRV and PRV during each of the BP states was assessed using Bland–Altman analysis20. From the Bland–Altman plots, the bias and difference between limits of agreement (LoAs) were obtained, and the ratio of agreement (BAR) was measured using (Eq. 2) and (Eq. 3), as suggested in refs. 8,42. Agreements were categorized as good (BAR ≤ 10%), moderate (10% < BAR ≤ 20%), or insufficient (BAR > 20%).

$${\mathrm{LoA}}=\overline{x}\pm 1.96{\sigma }_{x},x={\rm{HRV}}-{\rm{PRV}}$$
(2)
$${\mathrm{BAR}}=(100 \% )| \frac{1.96{\sigma }_{x}}{\overline{{\rm{HRV}}+{\rm{PRV}}}}| ,x={\rm{HRV}}-{\rm{PRV}}$$
(3)

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.