Introduction

The spread of the pandemic caused by the coronavirus SARS-CoV-2 has placed health care systems around the world under enormous pressure. Up to 10–15% of hospitalized cases with coronavirus disease 2019 (COVID-19) are in critical condition (i.e., severe pneumonia and hypoxemic acute respiratory failure, HARF), have received invasive mechanical ventilation, and are admitted to the intensive care unit (ICU)1,2. The shortage of critical care resources, both in terms of equipment and trained personnel, required a reorganization of the hospital facilities even in developed countries. In addition, some COVID-19 patients cannot be considered for invasive ventilation due to their frailty or comorbidities, and others are unwilling to undergo invasive techniques. As a result, a considerable proportion of severe patients are being treated in hospital settings outside the ICU.

Noninvasive respiratory support (NIRS) techniques, including high-flow oxygen administered via nasal cannula (HFNC), continuous positive airway pressure (CPAP) and noninvasive ventilation (NIV), have been used in severe COVID-19 patients, although their use was initially controversial due to doubts about its effectiveness3,4,5,6, and the risk of aerosol-linked infection spread7. Initial recommendations8,9,10,11,12 were based on previous evidence in non-COVID patients and early experience during the pandemic, but they differed in terms of the type of NIRS proposed as first option, and lacked COVID-specific evidence to support them.

So far, observational COVID-19 studies have suggested that either HFNC, CPAP or NIV may improve oxygenation and reduce the need for intubation or the risk of death13,14,15,16,17,18, but the effects of different NIRS techniques have been compared in few studies16,19,20. An observational study analyzing 670 patients found no differences in 30-day mortality or endotracheal intubation between HFNC, CPAP and NIV used outside the ICU, after adjusting for confounders16. In contrast, a randomized study of 110 COVID-19 patients admitted to the ICU found no differences in the 28-day respiratory support-free days (primary outcome) or mortality between helmet NIV and HFNC, but recorded a lower risk of endotracheal intubation with helmet NIV (30%, vs. 51% for HFNC)19.

Overall, the information supporting the choice of one or other NIRS technique is limited. Moreover, the COVID-19 pandemic is still active around the world, and data supporting an evidence-based choice of NIRS are urgently needed. In this multicentre, observational real-life study, we aimed to compare the effects of high-flow oxygen administered via nasal cannula, continuous positive airway pressure, and noninvasive ventilation, initiated outside the intensive care unit, in preventing death or endotracheal intubation at 28 days in patients with COVID-19.

Methods

Study design

A multicentre, retrospective cohort study of COVID-19 patients followed from NIRS initiation up to 28 days or death, whichever occurred first.

Study population

We included a consecutive sample of patients aged at least 18 years who had initiated NIRS treatment for HARF related to COVID-19 pneumonia outside the ICU at any of the 10 participating university hospitals, during the first pandemic surge, between 1 March and 30 April 2020. All participating hospitals belong to the National Health System of Catalonia, Spain, and attend a population of around 4.3 million inhabitants. COVID-19 diagnosis was confirmed through reverse-transcriptase-polymerase-chain-reaction assays performed on nasopharyngeal swab specimens. From a total of 419 candidate patients, we excluded those with: (1) respiratory failure not related to COVID-19 (e.g., cardiogenic pulmonary edema as primary cause of respiratory failure); (2) rejection or early intolerance to any NIRS treatment; (3) pregnancy; (4) nosocomial infection; and (5) PaCO2 above 45 mm Hg. A total of 367 patients were finally included in the study (Fig. 1), which was approved by the research ethics committee at each participating hospital (study coordinator centre, Hospital Vall d'Hebron, Barcelona; protocol No. PR(AG)265/2020). Research was performed in accordance with the Declaration of Helsinki. The requirement of informed consent was waived due to the retrospective nature of the study.

Figure 1
figure 1

Flowchart. ARF acute respiratory failure, HFNC high-flow nasal cannula, ICU intensive care unit, NIRS non-invasive respiratory support, NIV non-invasive ventilation. *HFNC, n = 2; CPAP, n = 6; NIV, n = 3. In addition, 26 patients who presented early intolerance were treated subsequently with other NIRS treatment, and were included as study patients in this second treatment: 8 patients with intolerance to HFNC (2 patients treated subsequently with CPAP, and 6 with NIV), 11 patients with intolerance to CPAP (5 treated later with HFNC, and 6 with NIV), and 7 patients with intolerance to NIV (5 treated after with HFNC, and 2 with CPAP).

Treatment strategies

The NIRS treatments evaluated were high-flow oxygen administered via nasal cannula (HFNC), continuous positive airway pressure (CPAP), and noninvasive ventilation (NIV). According to current Spanish recommendations8, criteria for initiating respiratory support were moderate to severe dyspnoea, respiratory rate > 30 bpm, or PaO2/FiO2 < 200 mmHg, screened either at hospital admission or ward admission. The decision regarding the choice of treatment was taken by the pulmonologist in charge of the patient’s care, with HFNC usually as the first step after the failure of conventional oxygen therapy8, and taking into account the availability of NIRS devices at each centre.

In the HFNC group, heated and humidified oxygen was applied through nasal prongs, at an initial flow rate of 50–60 lpm if tolerated. CPAP was initially set at 8–10 cm H2O and then adjusted according to tolerance and clinical response. In the NIV group, a pressure support ventilator mode was adjusted; a high positive end-expiratory pressure (PEEP) and a low support pressure were used to set a tidal volume < 9 ml/kg of predicted body weight8. NIRS treatments were applied continuously for at least 48 h while controlling oxygen delivery to obtain a target oxygen saturation measured by pulse oximetry (SpO2) of 92–96%21. In the NIV and CPAP groups, if the treatment was not tolerated continuously, a minimal duration of 8 h per day, predominantly during the night, was attempted, reaching a mean usage of 22 (4) h/day in NIV and 21 (4) h/day in CPAP (min-P25-median-P75-max 8-22-24-24-24 in both groups). HFNC was not used during breaks in the NIV or CPAP groups due to the limited availability of devices in the first wave of the pandemics. In order to minimize the risks of infection to staff, we applied NIV and CPAP treatments through oronasal or total face non-vented masks attached to single-limb circuits with intentional leak, and placing a low-pressure viral filter preventing exhaled droplet dispersion; in HFNC-treated patients, a surgical mask was put over the nasal prongs8,9.

Parallel to the start of NIRS, the ceiling of care was determined considering the patient’s wishes (or those of their representatives), underlying comorbidities, and frailty22. A do-not-intubate order was established at the discretion of the attending physician, after discussion with the critical care physician. In case of doubt, the final decision was discussed by the ethical committee at each centre. Intubation was performed when clinically indicated based on the judgment of the responsible physician.

In addition to NIRS treatment, conscious pronation was performed in some patients. Patients were treated and monitored continuously in adapted respiratory wards, with improved monitoring and increased nurse-patient ratio (1:4 to 1:6 in wards, and from 1:2 to 1:4 in high-dependency units).

Patients’ characteristics

Study data were collected and managed using REDCap electronic data capture tools hosted at ISGlobal (Institut de Salut Global, Barcelona)23. We obtained patients’ data from electronic medical records using a modified version of the standardized International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) COVID-19 case report forms24, including: (i) demographics (age, sex, ethnicity); (ii) smoking status; (iii) chronic conditions (cardiac disease, respiratory disease, kidney disease, neoplasm, dementia, obesity, neurological conditions, liver disease, diabetes, and a modified Charlson comorbidity index)25; (iv) symptoms at admission and physical signs at NIRS initiation (days since the onset of COVID-19 symptoms, temperature, heart rate, systolic and diastolic blood pressure, respiratory rate, and Quick Sequential Organ Failure Assessment (qSOFA) score)26; (v) arterial blood gases at NIRS initiation (PaO2/FIO2 ratio calculated for patients with available PaO2, and imputed from SpO2 for the 33% of patients without PaO2)27; (vi) laboratory blood parameters at NIRS initiation; (vii) chest X-ray findings (unilateral or bilateral pneumonia); and (viii) treatment received during admission (highest level of care received outside ICU, ICU admission, NIRS as ceiling of treatment, awake prone positioning, and drug treatments).

Study outcomes

The primary outcome was treatment failure, defined as endotracheal intubation or death within 28 days of NIRS initiation. Secondary outcomes were 28-day mortality, endotracheal intubation at day 28, in-hospital mortality, and duration of hospital stay.

Statistical analysis

With an expected frequency of 50% for intubation or death in patients with HARF and treated by NIRS28, 300 patients were needed in order to detect a significant difference greater than 20% between the types of NIRS evaluated in the present study, with an alpha risk of 0.05 and a statistical power of 80%.

Characteristics of the patients at baseline according to NIRS treatment were described by mean and standard deviation, median and 25th and 75th percentiles (P25 and P75) and by absolute and relative frequencies, and compared using Chi2, Anova and Kruskal Wallis tests. Given the small number of missing information and that missing were considered at random, we conducted a complete case approach.

Kaplan–Meier curves described the crude event-free rate in each NIRS group and were compared by means of the log-rank test. Multivariable Cox proportional-hazards regression models were used to estimate the hazard ratios (HR) for patients treated with NIV and CPAP as compared to HFNC (the reference group), adjusting for age, sex, and variables found to be significantly different between treatments at baseline (hospital, date of admission and sleep apnea). D-dimer levels and respiratory rate at baseline were also significantly associated with treatment, but since they had missing values for 82 and 41 patients respectively, these variables were only included in a sensitivity analysis.

To account for the potential effect modification, analyses were stratified according to hypoxemia severity (moderate-severe: PaO2/FIO2 < 150 mm Hg; mild-moderate: PaO2/FIO2 ≥ 150 mm Hg)4. To assess the potential impact of NIRS treatment settings, we compared outcomes within NIRS-group according to: flow in the HFNC group (> 50 vs. ≤ 50 L/min), pressure in the CPAP group (> 10 vs. ≤ 10 cm H2O), and PEEP in the NIV group (> 10 vs. ≤ 10 cm H2O).

Sensitivity analyses included: (1) repeating models excluding patients who changed their initial NIRS treatment during the course of the hospitalization to another NIRS treatment (crossover, n = 44); (2) excluding patients with missing measured PaO2/FIO2 (n = 123); (3) excluding patients receiving NIRS as ceiling of treatment (n = 140); and (4) additionally adjusting models for, one at a time, D-dimer levels, respiratory rate, systemic corticosteroid use and Charlson index.

All analyses were performed using StataCorp. 2019. Stata Statistical Software: Release 16. College Station, TX: StataCorp LLC.

Consent for publication

All authors have approved the submission and provide consent to publish.

Results

Patients' characteristics

Among the 367 patients included in the study, 155 were treated with HFNC (42.2%), 133 with CPAP (36.2%), and 79 with NIV (21.5%).

Most patients were male (72%), and the mean age was 67.5 years (SD 11.2). Chronic conditions were frequent (35% of the sample had a Charlson comorbidity index ≥ 2) and did not differ between NIRS treatment groups, except for sleep apnea (more common in the NIV-treated group, Table 1 and Table S1). At the initiation of NIRS, patients had moderate to severe hypoxemia (median PaO2/FIO2 125.5 mm Hg, P25-P75: 81–174). Clinical severity and laboratory values were well balanced between the groups (Table 2 and Table S2), except for respiratory rate (higher in patients treated with NIV).

Table 1 Patients’ baseline characteristics, according to non-invasive respiratory support group.
Table 2 Patients’ characteristics at the time of initiating non-invasive respiratory support.

Treatments

The NIRS treatments applied were not equally distributed among participating hospitals, although HFNC or CPAP were the first NIRS treatment choice at all centers (Table S1). Differences were also found in the NIRS treatments applied according to the date of admission: HFNC was the most frequent treatment early in the period (before 23 March), while CPAP was the most frequent choice in the second and the third periods (Table 1, p = 0.008). Noninvasive respiratory support treatments were applied as ceiling of treatment in 140 patients (38%) (Table 3). Table S3 shows the NIRS settings.

Table 3 Inpatient characteristics and treatments according to non-invasive respiratory support group.

Primary and secondary outcomes

The cumulative percentage of patients who had received intubation or who had died by day 28 (primary outcome) was 45.8% in the HFNC group, 36.8% in the CPAP group, and 60.8% in the NIV group (Fig. 2a). After adjustment, and taking patients treated with HFNC as reference, patients who underwent NIV had a higher risk of intubation or death at 28 days (HR 2.01, 95% CI 1.32–3.08), while those treated with CPAP did not present differences (HR 0.97, 95% CI 0.63–1.50) (Table 4).

Figure 2
figure 2

The 28-days Kaplan Meier curves from: (a) day starting NIRS to death or intubation; (b) day starting NIRS to intubation; and (c) day starting NIRS to death. NIRS non-invasive respiratory support.

Table 4 Outcomes by non-invasive respiratory support group.

As for secondary outcomes, patients treated with NIV had a significantly higher risk of endotracheal intubation, 28-day mortality, and in-hospital mortality than patients treated with HFNC, while no differences were observed between CPAP and HFNC (Fig. 2b,c, Table 4). A total of 73 patients (20%) were intubated during the hospitalization. Among them, 22 (30%) died within 28 days (5/36 in HFNC (14%), 5/14 in CPAP (36%), and 12/23 in NIV (52%) groups, p = 0.007). The patients who had died by day 28 were 117 (31.9%), 91 (65%) of those patients were treated with NIRS as ceiling of treatment and 26 (11.5%) were treated with NIRS not regarded as ceiling of treatment. Days between NIRS initiation and intubation (median (P25-P75) 3 (1–5), 3.5 (2–7), and 3 (3–5), for HFNC, CPAP, and NIV groups respectively; p = 0.341) and the length of hospital stay did not differ between groups (Table 4). Outcomes by hospital are listed in Table S4.

Stratified and sensitivity analyses

In patients with mild-moderate hypoxaemia, CPAP, but not NIV, treatment was associated with reduced outcome risk compared to HFNC (Table S5). The analyses excluding patients with missing PaO2/FIO2 or receiving NIRS as ceiling of treatment showed similar associations to those observed in the main analysis (Tables S6 and S7, respectively). No differences were found when we performed within NIRS-group comparisons according to settings applied (Table S8).

During the follow-up period, 44 patients (12%) switched to another NIRS treatment: eight (5%) in the HFNC group (treated subsequently with NIV), 28 (21%) in the CPAP group (13 switched to HFNC, and 15 to NIV), and eight (10%) in the NIV group (seven treated with HFNC, and one with CPAP). Excluding these patients showed no relevant changes in the associations observed (Table S9). Additional adjustment for D-dimer, respiratory rate, Charlson index, or treatment with systemic corticosteroids produced very similar results (Table S10).

Discussion

This study shows that noninvasive ventilation initiated outside the ICU for the treatment of hypoxemic acute respiratory failure secondary to COVID-19 resulted in higher mortality or intubation rate at 28 days (i.e., treatment failure) than high-flow oxygen or CPAP. These results were robust to a number of stratified and sensitivity analyses.

Most previous data on the effectiveness of NIRS treatments in severe COVID-19 patients came from studies which had limited sample sizes and were not designed to compare the different techniques13,14,15,17,18. In the only available study (also observational) comparing NIV, HFNC and CPAP outside the ICU16, conducted in Italy, the authors did not find differences between treatments in mortality or intubation at 30 days. The discrepancy between these results and ours may be due to differences in the characteristics of the patients included. First, in the Italian study, the mean PaO2/FIO2 ratio was 152 mm Hg, suggesting a less severe respiratory failure than in our patients (125 mm Hg). In the stratified analysis of our cohort, planned a priori, patients with a PaO2/FIO2 ratio above 150 responded similarly to HFNC and NIV treatments, suggesting that the severity of the hypoxemia might predict the success of NIV, as previously reported in non-COVID patients4,28,29. Second, the Italian study did not provide data on PaCO2, meaning that the improvements with NIV might have been attributable to the inclusion of some patients with hypercapnic respiratory failure, who were excluded in our study.

Recently, the effectiveness of CPAP or HFNC compared with conventional oxygen therapy was assessed in the RECOVERY-RS multicentric randomized clinical trial, in 1,273 COVID-19 patients with HARF who were deemed suitable for tracheal intubation if treatment escalation was required20. In this study, the requirement of intubation or mortality within 30 days (primary outcome) was significantly lower with CPAP (36%) than with conventional oxygen therapy (45%; absolute difference, − 8% [95% CI, − 15% to − 1%], p = 0.03). This improvement was mostly driven by a reduction in the need of intubation, but no differences in mortality were seen (16.7% vs 19.2%, respectively). No significant differences in the main outcome were found between HFNC (44%) vs conventional oxygen therapy (45%; absolute difference, − 1% [95% CI, − 8% to 6%], p = 0.83). The main difference in respect to our study was the better outcomes of CPAP compared with HFNC. However, the RECOVERY-RS study may have been underpowered for the comparison of HFNC vs conventional oxygen therapy due to early study termination and the number of crossovers among groups (11.5% of HFNC and 23.6% of conventional oxygen treated patients).

There are several possible explanations for the poor outcome of COVID-19 patients undergoing NIV in our study. First, NIV has been reported to produce overdistension, compounded by the respiratory effort itself30, which could result in ventilation-induced lung injury due to the excessive increases in tidal volumes28,31. This risk would be avoided in CPAP and HFNC because they improve oxygenation without changing tidal volume32,33. Unfortunately, tidal volume measurements during NIV were not available in our study to support or reject this hypothesis. Second, patient-ventilator asynchronies might have arisen in NIV-treated patients making more difficult their management outside the ICU setting and thereby explaining, at least partially, their worse outcomes. Third, a bench study has recently reported that some approaches to minimize aerosol dispersion can modify ventilator performance34. In short, the addition of intentional leaks, as in our study, led to a lower maximal pressure without a significant impact on the work of breathing and without increasing patient-ventilator asynchronies34. Then, in the present work, we believe that the availability of trained pulmonologists to adjust ventilator settings may have overcome this aspect. Fourth, non-responders to NIV could have suffered a delay in intubation, but in our study the time to intubation was similar in the three NIRS groups, thus making this explanation less likely. Fifth, we cannot exclude the possibility that NIV implied a more complicated clinical course than HFNC or CPAP. Patients undergoing NIV may require some degree of sedation to tolerate the technique, but unfortunately we have no data on this regard. Furthermore, NIV and CPAP may impair expectoration which could contribute to bacterial infections, although this hypothesis remains unknown with the present data. Finally, we cannot rule out the possibility that NIV was tolerated worse than HFNC or CPAP, which would have reduced adherence and lowered the effectiveness of the therapy. However, the number of patients abandoning their original treatment was nearly twice as high in the CPAP group than in the NIV group.

In the treatment of HARF with CPAP or NIV the interface via which these treatments are applied should be considered, since better outcomes have been reported with a helmet interface than with face masks in non-COVID patients6,35 , possibly due to a greater tolerance of the helmet and a more effective delivery of PEEP36. As noted above, a single randomized study has evaluated helmet NIV against HFNC in COVID-1919, and, in spite of the lower intubation rate in the helmet NIV group, no differences in 28-day mortality were registered. More studies are needed to define the place of treatment with helmet CPAP or NIV in respiratory failure due to COVID-19, together with other NIRS strategies. In our study, CPAP and NIV treatments were applied via oronasal and full face masks, reflecting the fact that most hospitals in our country have little experience with the helmet interface.

Our study supports several guidelines37,38 that favor HFNC and CPAP over NIV for the treatment of HARF in COVID-19 patients, but to our knowledge no previous data have been published in support of this recommendation. Furthermore, our results suggest that the severity of the hypoxemic respiratory failure might help physicians to decide which specific NIRS technique could be better for a patient. However, the retrospective design of our study does not allow establishing a causative link between NIV and the worse clinical outcomes observed. Obviously, reaching a definitive conclusion on this point will require further studies with better phenotypic characterization of patients, and considering additional factors implicated in the response to therapies such as the interface used or the monitoring of the inspiratory effort.

This study has some limitations. First, the observational design could have resulted in residual confounding by selection bias. However, the inclusion of patients was consecutive and the collection of variables was really comprehensive. Moreover, NIRS treatment groups exhibited only minor differences which were accounted for in the multivariable and sensitivity analyses thus minimizing the selection bias risk. Although treatment received and outcomes differed by hospital, this fact was taken into account through adjustment. Second, we must be cautious before extrapolating our results to other nonemergency situations. Our study was carried out during the first wave of the pandemics when the healthcare system was overwhelmed and many patients were treated outside ICU facilities. As mentioned above, NIV might have better outcomes in a more controlled setting allowing an optimal critical care39. However, the scarcity of critical care resources has remained along the different pandemic surges until now and this scenario is unfortunately frequent in other health care systems around the world. Thus, we believe that our results may be useful for a great number of physicians treating COVID-19 patients around the world. Third, crossovers could have been responsible for differences observed between NIRS treatments but their proportion was small (12%) and our results did not change when these patients were excluded. Fourth, it could be argued that changes in treatment strategies over the timeframe of the study may have led to differential effects of the NIRS. Nevertheless, we do not think it may have influenced our results, because analyses were adjusted for relevant treatments such as systemic corticosteroids40 and included the time period as a covariate. And finally, due to the shortage of critical care ventilators at the height of the pandemic, some patients were treated with home devices with limited FiO2 delivery capability and, therefore, could have been undertreated41,42. However, as more home devices were used in the CPAP group (81.6% vs. 38% in the NIV group; Table S3), and better outcomes were recorded in the CPAP-treated patients, our result do not support this concern.

The main strength of this study is, in our opinion, its real-life design that allows obtaining the effectiveness of these techniques in the clinical setting. In the current situation with few available data from randomized control trials regarding the best choice to treat COVID-19 patients with noninvasive respiratory support, data from real-life studies like ours may be appropriate43. These data are complementary and still useful later on by including some patients usually excluded from randomized studies; patients with do-not-intubate orders are an example and, obviously, they represent a challenge for the physician responsible to decide the best therapeutic strategy.

Conclusions

In conclusion, the present real-life study shows that, in the context of the pandemic and outside the intensive care unit setting, noninvasive ventilation for the treatment of hypoxemic acute respiratory failure secondary to COVID-19 resulted in higher treatment failure than high-flow oxygen or CPAP. These findings may be relevant for many physicians elsewhere since the successive pandemic surges result in overwhelmed health care systems, leading to the need for severe COVID-19 patients to be treated out of critical care settings.