Introduction

Extubation readiness in extremely preterm infants is a complex medical issue difficult to address with simple solutions.1 Given the lack of accurate predictors, physicians often make decisions about extubation based on their clinical judgment,2 which might partially explain their high failure rates.3 After extubation, the use of non-invasive respiratory support decreases the risk of extubation failure.4 Most common modes used are nasal continuous positive airway pressure (NCPAP) and non-synchronized nasal intermittent positive pressure ventilation (ns-NIPPV). The best choice remains controversial as meta-analyses of several studies showed very low precision and high degree of bias, and the largest RCT was not able to demonstrate any superiority between CPAP or NIPPV.5,6 Synchronization during NIPPV has also been postulated as an important requirement for successful outcomes.7,8 Non-invasive neurally adjusted ventilatory assist (NIV-NAVA) can provide ventilation that is proportional and synchronized to the patient’s respiratory drive using the electrical activity of the diaphragm. Synchronized NIPPV has showed encouraging preliminary findings for reduction of extubation failure rates.9,10,11

Heart rate and respiratory rate are typical vital signs measured during respiratory support, but interpretation of those measurements is often limited to trends or averages over time. More sophisticated analyses of those signals could provide additional information on the individual’s clinical status, physiological response to treatment, and even predict outcomes.12,13,14,15,16,17,18,19,20,21,22 In fact, measurements of heart rate variability (HRV) have been able to identify system instabilities before any clinical signs of deterioration, as demonstrated for the prediction of sepsis12 or outcomes of hypoxic-ischemic encephalopathy.13 More importantly, various cardiorespiratory measures have also been found to be predictive of extubation outcomes in mechanically ventilated preterm infants, including HRV,14,15,16,17,18,19 diaphragmatic activity,20,21 and respiratory variability (RV).21,22

Given that mechanical ventilation is associated with high rates of morbidities, physicians commonly decide towards early disconnection despite the dilemma around extubation readiness. Thus, early identification of infants at increased risk of failure shortly after extubation may also be helpful to identify higher risks patients and maybe guide clinical care. In this study we aim to investigate the association between cardiorespiratory measures performed shortly after extubation and extubation outcomes in infants receiving three common modes of non-invasive respiratory support: NIV-NAVA, NCPAP and NIPPV. Furthermore, we also assessed the predictive ability of those measures to identify successful extubation.

Methods

Population

This is a secondary analysis of a prospective, unblinded, randomized crossover trial from a level III neonatal intensive care unit in Montreal, Canada (ClinicalTrials.gov Identifier: NCT02723123). Between July 2016 and August 2018, the study enrolled infants with birth weight (BW) ≤ 1250 g who were electively extubated for the first time from mechanical ventilation (MV) to non-invasive respiratory support. Infants with neuromuscular disease, major congenital anomalies, diaphragmatic paralysis or palsy, diagnosed phrenic nerve injury, hemodynamic instability, esophageal perforation, congenital heart defects, or receiving inotropes, narcotics, or sedative agents were excluded. All clinical decisions related to pre- and post-extubation ventilatory parameters, extubation, type of post-extubation respiratory support provided outside of the study period, and reintubation were made by the medical team but data was recorded and reported here. As part of institutional practices, all extremely preterm infants were extubated to either NCPAP or non-synchronized NIPPV outside of the study period, and oxygen was titrated to maintain oxygen saturation targets between 88–92%. The unit has guidelines defining non-invasive support failure, and therefore, need of reintubation. Those criteria include: (a) fraction of inspired oxygen (FiO2) consistently > 50% for 1 to 2 h, (b) severe respiratory acidosis: PCO2 > 65 mmHg with a pH < 7.20 in 2 consecutive arterial/venous blood gases, (c) two episodes of apnea requiring bag/mask ventilation due to failure to resume spontaneous breathing, (d) frequent episodes of apnea and/or bradycardia (>6 episodes in 6 h), and (e) significant increase in work of breathing. However, the medical team had the final decision concerning reintubation. Thus, the primary reason for reintubation provided by the medical team was also collected. The study was approved by the institutional research ethics board of the McGill University Health Center and informed consent was obtained from parents or legal guardians.

Study design and data acquisition

Electrocardiogram (ECG) and the electrical activity of the diaphragm (Edi) signals were collected during the study period. Prior to extubation, 3 additional ECG leads were placed on the infants’ limbs (at least 1 cm apart from existing leads to prevent interference) and a NAVA catheter with embedded electrodes (6-French/49 cm specialized feeding tube) was placed at the level of the diaphragm. The catheter connected to the NAVA module of the Servo-i ventilator (Maquet Critical Care, Solna, Sweden) and its correct positioning was confirmed as previously described.23 Immediately following extubation, infants received the type of non-invasive respiratory support prescribed by the medical team (NCPAP or ns-NIPPV) for 30–60 min to allow stabilization before initiating the recordings. After that, infants then received the study interventions (NCPAP, NIPPV, and NIV-NAVA) while in a resting supine position, and for 40 min on each mode. The first 10 min of recordings were a priori decided to be excluded from analysis to provide a wash-out period between modes. The interventions were provided in a random order determined by a computer-generated randomization sequence, generated by author G.S. and implemented by authors S.L., M.B., and W.S. (allocation was not concealed). A timeline of the study design is provided in Supplemental Fig. S1.

To avoid multiple changes, all modes of non-invasive support were delivered by the Servo-i ventilator (Maquet Critical Care, Solna, Sweden) using infant binasal prongs (Hudson RCI, Wayne, Pennsylvania). The positive end-expiratory pressures (PEEP) or NCPAP levels matched the post-extubation stabilization period pressures prescribed by the attending physician. The pressure control mode on the Servo-i ventilator was used to provide NIPPV, with peak inflation pressures (PIP) set to 10 cmH2O above the PEEP at a rate of 20 breaths per minute. If deemed necessary by the research staff or medical team for infant stability, increases in PIP and rates were permitted, with any such deviations recorded. For NIV-NAVA, the NAVA levels were adjusted in attempt to match the PIP levels provided during NIPPV. The pressure waveforms from the Servo-i were analyzed offline to determine the number of PIPs delivered, median PIP value, and MAP during NIPPV and NIV-NAVA. The occurrence of bradycardias (heart rate <100 bpm) and desaturations (SpO2 < 85%) detected by the hospital’s monitoring system were also manually recorded by the research team at the bedside (excluding those during wash-out periods), as these are the events that the medical team responds with any intervention. Changes in the FiO2 were also recorded.

Heart rate variability

Using a portable data acquisition system (FE132 Bio Amp and PowerLab, ADInstruments, Colorado, U.S.A), ECG signals were amplified, anti-alias filtered, sampled at 1000 Hz, and stored in a research computer for offline analysis using the LabChart software (Version 8.1.4, ADInstruments). ECG signals were analyzed using an in-house developed program, with documentation of the algorithm and MATLAB (MathWorks, Massachusetts, U.S.A.) application being publicly available at http://hrvtoolkit.com. Analyses were performed using the full 30-min segment.

For HRV, the following time domain, frequency domain, and non-linear parameters were calculated:24,25

Time domain

Minimum, maximum, and median NN (normal-to-normal, or R-wave-to-R-wave) interval; standard deviation of the NN intervals, i.e., the variation of intervals measured between consecutive sinus beats (SDNN); the standard deviation of the averages of NN intervals in all 5-min segments of the recording (SDANN); the mean of the standard deviations of all NN intervals for all 5-mine segments of the recording (SDNNi); the percentage of adjacent NN intervals differing by >50 ms (pNN50); root mean square of successive differences of NN intervals (RMSSD); total number of all NN intervals divided by the height of the 32-bin histogram of all NN intervals (triangular index; HRVTi); and the baseline width of the minimum square difference triangular interpolation of the highest peak of the 32-bin histogram of all NN intervals (TINN).25

Frequency domain

TP: total power (<0.4 Hz), VLF: very low frequency (<0.04 Hz), LF: low frequency (0.04–0.15 Hz), HF: high frequency (>0.15 to <0.4 Hz), and LF/HF ratio using the Welch periodogram.25

Non-linear

Sample entropy (SampEn; r = 0.1, m = 3), a measure of chaos or disorder in the NN series; detrended fluctuations analysis (DFA) scaling exponents alpha1 (short-term; 4–13 beats), and alpha2 (long-term; 14–100 beats), measures of self-similarity, or repeated patterns when inspected on different scales within the NN series; the standard deviation of intervals perpendicular to the identity line of the Poincaré plot (SD1), a measure of short-term variability, and the standard deviation of intervals along the identity line (SD2), a measure of overall variability.24

Respiratory analysis

Edi signals were acquired at a sampling frequency of 100 Hz and stored for offline analysis using the Servo Tracker software (Maquet Critical Care, Solna, Sweden). For each patient and mode, the full 30-min segment of Edi signals were analyzed using an automated algorithm developed on MATLAB (R2017a, MathWorks, Natick, U.S.A.), with peak detection functions using minimum amplitude of 1 μV and at least 0.5 s apart. An example of the Edi analysis is provided in Supplemental Fig. S2.21 The median values of the following parameters were calculated and used for comparisons:

Diaphragmatic activity

The entire Edi signal was used to calculate: (a) minimum value of Edi (Edi min), i.e., the resting tonic state; (b) maximum value of Edi (Edi max), i.e., the maximal inspiratory effort; and (c) area under the entire signal (Edi area), i.e., an overall assessment of respiratory effort. A breath-by-breath analysis was used to calculate: (a) breath area, i.e., area under the breath curve; (b) breath amplitude, i.e., the minimum vertical distance that the signal descends on either side of the peak (peak-to-trough distance); (c) breath width, i.e., the distance between the points to the left and right of the peak at half the amplitude; (d) neural inspiratory time (NTi), i.e., the time from trough to peak; and (e) neural expiratory time (NTe), i.e., the time from peak to trough.

Respiratory variability

Breath-to-breath (BB) time intervals were obtained and the following measures of variability were computed for breath areas, NTi, NTe, breath amplitudes, and widths (obtained through the diaphragmatic activity breath-by-breath analysis described above): standard deviation (SD), coefficient of variation (CV), and the standard deviation of the successive differences (SDSD).21

Clinical data

The following baseline patient demographics were prospectively collected: BW, gestational age, sex, APGAR score at 5 min, delivery room intubation, age at intubation, surfactant use, weight at extubation, postnatal age and post-menstrual age at extubation, days on MV, pre-extubation blood gas (pH, partial pressure of CO2, bicarbonate, base excess) and ventilator settings (ventilation mode, PIP, PEEP, mean airway pressure (MAP), FiO2, and inflation rate), and non-invasive respiratory support provided after the study (mode, PIP, PEEP, and inflation rates). Extubation failure was defined as reintubation within 7 days of extubation, as this seems to be an optimal timeframe to capture as many respiratory-related reasons for failure while excluding most non-respiratory-related failures.26

Sample size and statistical analysis

For the crossover trial, a convenience sample size of 30 infants was chosen, with an expected loss of patients up to 20–25% for poor signal quality, technical issues, clinical instability, or need of immediate reintubation. (4,5) Given that this was a post hoc analysis, there was no pre-determined sample size based on extubation outcomes. All cardiorespiratory and clinical variables are expressed as median [IQR] or number (%) and were compared using the Wilcoxon rank sum test, or the Chi-square or Fisher’s exact tests. Given the primary results of this crossover trial demonstrating significant differences in cardiorespiratory variability measures between the modes of support,27 success and failure groups were compared while on each mode separately. Statistics were computed using MATLAB (R2018b, MathWorks, Massachusetts, U.S.A.) and a p-value < 0.05 was considered significant. Significant parameters were further evaluated for their ability to identify extubation success by calculating the area under the receiver-operator characteristics curves (AUCROC), sensitivity and specificity.

Results

A total of 30 infants were studied and 23 (77%) had full data recordings for analysis; a flow diagram of patient recruitment is provided in Supplemental Fig. S3. Fourteen infants were successfully extubated while 9 failed, yielding a 39% reintubation rate. All failure infants were reintubated for the primary reason of apneas and bradycardias according to the medical clinical team. The median time at reintubation was 1.6 days (37 h) after extubation, with a range from 0.6 to 4.8 days (13 to 116 h). In this high-risk group of infants for extubation failure, there were no significant differences between success and failure for any pre-extubation clinical variable. Patient demographics are presented in Table 1.

Table 1 Patient demographics.

All settings during the study period for each mode are provided in Table 2. Some individualized setting changes were made in 2 infants based on clinical judgment: (1) PIP/PEEP changed to 20/8 and rate of 40 ipm during NIPPV, and (2) rate increased to 40 ipm during NIPPV. There were no significant differences in the minimum or maximum FiO2 provided, or presence of desaturation or bradycardia events between the three modes (Table 2).

Table 2 Study settings.

Heart rate variability

Compared to infants successfully extubated, VLF power was significantly higher in the failure group during NIV-NAVA (174.6 [108.6–1252.4] vs. 70.6 [48.9–269.2] ms2, p = 0.041; Fig. 1). Also, failure infants had significantly lower SampEn during NCPAP (0.8 [0.6–0.9] vs. 1.5 [0.8–1.7], p = 0.030; Fig. 1) and NIPPV (0.7 [0.6–0.9] vs. 1.1 [0.9–1.6], p = 0.030) when compared to success infants, with no differences during NIV-NAVA mode. Complete HRV results are provided as Supplemental Table S1. Both variables had a moderate predictive value for early detection of extubation failure, with an area under the receiver-operator characteristics (AUCROC) of 0.76 for VLF power during NIV-NAVA, and 0.78 for SampEn during both NCPAP and NIPPV (Table 3). Sensitivities for these two HRV parameters ranged from 64.3% to 85.7%, and specificities from 66.7% to 88.9% (Table 3).

Fig. 1: Heart rate variability, modes of non-invasive respiratory support, and extubation outcomes.
figure 1

Extubation success group is represented in blue (left box plot) and failures in red (right box plot) for each mode. VLF very low frequency, NCPAP nasal continuous positive airway pressure, NIPPV nasal intermittent positive pressure ventilation, NIV-NAVA non-invasive neurally adjusted ventilatory assist, red cross = outliers, *success vs. failure p < 0.05 by Wilcoxon Rank Sum test.

Table. 3 Receiver-operator characteristics (ROC) for the identification of successfully extubated infants.

Diaphragmatic activity and respiratory variability

There were no significant differences in diaphragmatic activity noted between success and failure infants, or for subgroups of success and failure infants between each mode. Complete diaphragmatic activity results are provided as Supplemental Table S2.

Infants who failed extubation had significantly higher respiratory variability in breath areas, breath amplitudes, and NTe (Fig. 2). For the breath areas, failure infants had higher SDArea during NCPAP (372 [343–504] vs. 232 [141–409] uV•s, p = 0.047) and NIV-NAVA (454 [394–634] vs. 258 [157–462] uV•s, p = 0.041) and SDSDArea during NCPAP (505 vs. 294 uV•s, p = 0.047). For the breath amplitudes, failure infants had higher SDAmp during NCPAP (8.09 [6.35–8.55] vs. 4.74 [3.00–7.32] μV, p = 0.035) and NIV-NAVA (7.69 [7.54–8.41] vs. 4.37 [3.15–6.64] μV, p = 0.018), and SDSDAmp (10.28 [9.07–11.94] vs. 5.83 [3.95–8.47] μV, p = 0.047) during NIV-NAVA. For the NTe, failure infants had significantly higher CVNTe (0.71 [0.61–0.88] vs. 0.51 [0.48–0.67], p = 0.047). Complete respiratory variability results are provided as Supplemental Table S3. Respiratory variability parameters also had AUCROC values ranging from 0.75 to 0.80, with sensitivities from 64.3% to 78.6% and specificities from 77.8% to 100% (Table 3).

Fig. 2: Respiratory variability, modes of non-invasive respiratory support, and extubation outcomes.
figure 2

Extubation success group is represented in blue (left box plot) and failures in red (right box plot) for each mode. SD standard deviation, CV coefficient of variation; SDSD standard deviation of successive differences, NTe neural expiratory time, NCPAP nasal continuous positive airway pressure, NIPPV nasal intermittent positive pressure ventilation, NIV-NAVA non-invasive neurally adjusted ventilatory assist, *success vs. failure p < 0.05 by Wilcoxon Rank Sum test.

Discussion

In this secondary analysis of a randomized crossover trial, infants who subsequently succeeded or failed their extubation attempt demonstrated significant differences in cardiorespiratory variability (HRV and RV) shortly after extubation while receiving NCPAP, NIPPV, and NIV-NAVA in a random order. Although the significant differences were observed during certain modes of respiratory support, similar trends were noted during all modalities, with higher RV and VLF power (HRV), and lower sample entropy (HRV) observed in infants that failed their extubation, with a moderate accuracy in predicting successful extubation. Interestingly, different physiological responses to the tested modes of non-invasive respiratory support between those who fail vs. succeed extubation were noted even though no differences were observed in clinical stability or oxygen requirements during the recordings.

Heart rate variability

HRV is being increasingly explored as an extubation outcome predictor in preterm infants, with significant differences found both pre- and post-extubation.14,15,16,17,18,19 During the post-extubation period, heart rate characteristics, a composite measure including SDRR and SampEn, was found to be increased (indicating a reduction in HRV) in patients that went on to fail their extubation attempt.15 Furthermore, similar studies conducted by our research group have found different responses between successful and failed extubation attempts during different modes of non-invasive respiratory support.18,19 Indeed, in the present study, we found a significant increase in VLF power during NIV-NAVA, with similar trends observed during the other modes, and significant reductions in SampEn during NCPAP and NIPPV in the failure group. While the physiological significance of VLF power is not well defined and its interpretation is limited,25 studies in newborns have found changes in VLF with respect to thermoregulation,28 maturation,29,30 and sleep states.31 These results contradict previous findings, where sicker infants typically have reduced VLF (and other frequency ranges), as shown in newborns infected with respiratory syncytial virus32 and with increasing severity of hypoxic-ischemic encephalopathy.33 This raises the possibility that synchronization during non-invasive support was able to affect and uncover an ANS dysfunction amongst infants that went on failing. Nevertheless, given its unclear physiological significance, interpretations of VLF power are speculative and require further investigations.

In contrast, SampEn has demonstrated greater clinical utility. SampEn is a measure of regularity, chaos, and unpredictability, with a zero value indicating a perfectly regular signal and a value of 2 indicating a completely random signal. While previous studies have found changes in SampEn in neonates with positioning,34,35,36 pain,37,38 and maturational changes,35,39 it’s been most useful in the early identification of sepsis. Integrated into heart rate characteristics monitoring, reductions in SampEn have been predictive of impending sepsis in preterm infants, which has been characterized by low baseline variability with transient decelerations.40,41,42 While the physiological mechanism for this is unclear,41 this is in line with our findings of reduced SampEn in the failure group. Interestingly, both VLF and SampEn had moderate strength in predicting extubation outcomes (AUCROC 0.76–0.78) shortly after extubation.

Respiratory variability

Changes in RV, both increases and decreases, have been associated with worse respiratory and extubation outcomes.21,22,43,44 An increased RV for multiple parameters of respiratory depth and timing was observed in the failure group during NCPAP and NIV-NAVA, with a similar trend during NIPPV. Unstable breathing patterns will increase RV, as has been observed in infants with sepsis and necrotizing enterocolitis where instability was characterized by increased apneas, pauses, or periodic breathing.45,46 Indeed, RV was evaluated in extremely preterm infants receiving mechanical ventilation and a period of endotracheal CPAP prior to extubation and a significantly increased RV was also noted in the failure group during both types of support.21 It should be noted that 11/13 (85%) infants from that study were also included in the present study, making the general increase in RV of the failure group consistent from the pre- to the post-extubation period. Like HRV, RV parameters also had moderate strength in predicting extubation outcomes, with AUCROC ranging from 0.75–0.80. Thus, future studies attempting to predict extubation outcomes should consider incorporating measures of cardiorespiratory variability to identify a higher risk population likely to benefit from any intervention.

Study limitations and strengths

This study has some limitations. As a secondary analysis comparing two subgroups, the sample sizes within each group were small, increasing the risk of Type 1 and Type 2 errors, and multiple comparative analyses were performed. Given the exploratory nature of this analysis, these findings should be interpreted with caution but encourage future and larger studies, assisting in their design planning and parameter selection. Larger sample sizes may have provided greater insight into how the subgroups responded to the different modes of support. Seven patients (23%) were lost due to NCPAP failure or technical issues. The presence of desaturations and bradycardia events were manually recorded as free text, lacking specific information on the frequency and duration of these events. Exposure times were 40 min on each mode of support, and it is unclear what changes may have occurred with longer exposure times, such as clinical adaptations or deteriorations. The definition of extubation failure was reintubation within 7 days; however, a different timeframe could have potential effects on the predictive ability of the cardiorespiratory analyses. Future studies with larger sample sizes should explore the effect of shorter and longer definition timeframes. Furthermore, it is unclear if these findings are generalizable beyond the first extubation attempt to subsequent extubation attempts in the event of failure. The NIV-NAVA levels provided in this study were selected in an attempt to match the PIP delivered to the infant while on NIPPV. Interestingly, the levels needed for that were similar to the ones used during regular clinical care in our unit. This study also has some important strengths. It collected cardiorespiratory signals in a high-risk (39% failure rate) group of extremely preterm infants during the critical time period shortly after extubation. Three commonly used modes of non-invasive respiratory support were tested on each patient in a random order and a comprehensive analysis of cardiorespiratory signals were performed. The cardiorespiratory variability differences between successful and failed extubations had similarities across all three modes, indicating a potential generalizability of the findings regardless of the non-invasive respiratory support modality used; however, this requires validation in a larger study over a longer study period.

Conclusion

In a high-risk cohort of extremely preterm infants, measures of cardiorespiratory variability performed shortly after extubation were significantly different between infants who subsequently succeed or fail. Indeed, HRV and RV had moderate predictive accuracy for extubation success, demonstrating potential to be used as biomarkers for early identification of infants at higher risk of failure. However, as a secondary, exploratory study with a small sample size, larger studies are needed to validate the findings. Future investigations using practical tools, able to yield cardiorespiratory variability results within a few minutes or hours, may improve its prognostic ability, which will be important for individually guided therapies in trials or clinical practice.