Introduction

In patients with chronic obstructive pulmonary disease (COPD) who have emphysema, decreasing lung elastic recoil pressure without changes in the elastic properties of the chest wall and increasing airway resistance leads to increased time for lung emptying1. Insufficient exhalation causes increased operating lung volumes and progressive air trapping, resulting in dyspnea2. Pulmonary hyperinflation that occurs at rest is known as static hyperinflation3. During exercise in typical situations, to accommodate additional respiratory demands triggered by physical exertion, respiratory rate (RR) and tidal volume (VT) increase4. Patients with COPD have difficulty increasing VT and increase RR because they breathe at high lung volumes4. Increased RR reduces expiratory time by increasing air trapping4. This hyperinflation that occurs during exercise is known as dynamic hyperinflation (DH)5. DH is defined as an increase in functional residual capacity (FRC) or end-expiratory lung volume (EELV) above the resting value during periods of dynamic activity such as exercise6,7.

Several measures can be implemented to reduce DH in patients with COPD8,9,10. The use of inhaled bronchodilators8, as well as endurance physical training10,11, may improve exercise-induced dyspnea and hyperinflation. Ventilatory strategies, such as the pursed-lip breathing technique9,12 and noninvasive ventilation (NIV), have also proved effective in reducing DH in patients with COPD13,14,15,16,17,18,19. However, NIV requires the use of specific, expensive equipment that is not always available in healthcare facilities. Therefore, the easy-to-use, low-cost, expiratory positive airway pressure (EPAP) applied by a face mask aims to increase resistance in the expiratory phase, which would reduce physiological dead space, minute volume (VE), and RR, with consequent increase of the VT, being able to improve the length-tension relationship of the respiratory muscles, making them more efficient20,21. In addition, a reduction in inspiratory overload of hyperinflated lungs in patients with COPD would improve neuromuscular coupling22.

Several studies have evaluated the effects of EPAP during exercise on individuals with COPD17,18,23,24,25,26,27. However, controversy persists in the literature about the effects of EPAP on DH17,18,24,26,27 and exercise tolerance in individuals with COPD17,24,25,26,27. Therefore, we conducted this systematic review and meta-analysis of available studies that assessed the effects of EPAP during exercise on the onset of DH, exercise tolerance, and symptom intensity in subjects with COPD.

Methods

Protocol and eligibility criteria

We performed this systematic review according to the Updated guidance for trusted systematic reviews: a new edition of the Cochrane Handbook for Systematic Reviews of Interventions28, which included studies with subjects with mild to very severe COPD, according to the criteria set by the Global Initiative for COPD (GOLD)29, with clinical stability of the disease, presenting DH (10% IC reduction and/or a reduction greater than 150 ml, relative to the resting value)4 after stress test and who had used positive expiratory pressure as a strategy to minimize the onset of DH, increase exercise capacity, and/or reduce dyspnea, fatigue or desaturation. Studies that used continuous positive airway pressure (CPAP) or any other modality of NIV were excluded. Studies were included in English and Portuguese, with any follow-up or monitoring time and published in full version. Studies with lack of data and/or incomplete data, gray literature and multiple publications in which results were repeated were excluded, and only one of these studies was included.

Search strategies

The following electronic databases were searched: MEDLINE (Access by PubMed), Cochrane CENTRAL, PEDro, and EMBASE (up to dezember 10, 2019). Search terms included MeSH and COPD-related terms, positive expiratory pressure, dynamic hyperinflation, and Exercise Tests. The terms were adjusted to meet the requirements of each electronic database. We selected the list of included study references to identify additional RCTs. A full list of terms selected for searching the electronic databases is presented as supplemental information (Supplementary Table S1).

Study selection and data extraction

Two reviewers separately and independently selected the study titles and abstracts identified in the initial research. A standard screening checklist based on eligibility criteria was employed for each study. Studies that did not meet the eligibility criteria according to titles or abstracts were excluded. The full texts of the remaining studies were retrieved for a second independent review by the two reviewers. For studies lacking enough information to assess eligibility criteria, we contacted the authors by email for additional information.

We extracted the following data from the included studies: description of study participants, intervention in the experimental and control groups, and description and outcome measures. Two reviewers independently extracted data from eligible studies. Discrepancies were resolved through discussion, and a third author was consulted when a consensus was not reached. We contacted the authors via email for additional information when studies provided incomplete descriptions. Procedures for estimating missing data29 were performed when possible. If data were still insufficient after these processes, the results were included in the descriptive analysis only.

The primary outcome analyzed was the behavior of DH, assessed by measuring IC, with the application of EPAP at different pressure levels. As secondary outcomes, we assessed the impact of EPAP on exercise capacity, the sensation of dyspnea, respiratory rate, peripheral oxygen saturation, sense of effort in lower limbs (LL) and heart rate.

Assessment of bias risk

Two reviewers independently assessed the risk of bias in the included studies using the Cochrane Risk of Bias Tool28. The following items were assessed for each study: selection bias (random sequence generation and allocation masking), performance bias (blinding of patients and investigators), detection bias (blinding of outcome assessors), attrition bias (description of losses and exclusions) and reporting bias (selective reporting).

Summary of evidence: GRADE-criteria

We presented the overall assessment of the quality of evidence using the GRADE approach, as recommended by the Cochrane Manual for Systematic Reviews of Interventions28 (Table 3). For each specific outcome, the quality of evidence was based on five factors: (1) risk of bias, (2) inconsistency, (3) indirect evidence, (4) inaccuracy, and (5) other considerations (publication bias). Quality was reduced by one level for each of the missing factors. The GRADE approach resulted in four levels of evidence quality: high, moderate, low, and very low.

Data analysis

Estimates of combined effects were obtained by comparing the mean change from baseline to the end of the study for each group and were expressed as the weighted mean difference between groups. Studies in which it was not possible to calculate the standard deviation of the mean change were imputed as directed in the Handbook28. Calculations were performed using a random-effects method. A p-value < 0.05 was considered statistically significant. Statistical heterogeneity of treatment effects between studies was assessed by the Inconsistency test I2, where values above 25% and 50% were considered indicative of moderate and high heterogeneity, respectively30. All analyses were conducted using Review Manager (version 5.3). To explore heterogeneity between studies, we reviewed the meta-analyses by removing one article at a time to see if any individual study explained the heterogeneity. Still, sensitivity analysis was performed taking into account the different EPAP pressures used in the included studies (10 cmH2O and between 5 and 10cmH2O).

Results

Description of studies

The search strategy generated 2,227 records, 34 of which were considered potentially relevant and were retrieved for detailed analysis. Figure 1 shows the inclusion processes. Seven studies, with a total of 226 patients with moderate to very severe COPD, met the eligibility criteria for the systematic review. Only one study was randomized and controlled, and the rest used crossover design. Table 1 summarizes the characteristics of these studies. Five studies18,23,24,26,27 applied EPAP through a resistor face mask. Three studies used the 6MWT as a protocol17,25,26 for the evaluation of the effect of EPAP, and two used CEPT in cycle ergometer for lower limbs23,24. All studies used constant work rate tests, except one that used an incremental test23.

Figure 1
figure 1

Flow diagram of the studies included. EPAP: Expiratory positive airways pressure.

Table 1 Characteristics of included studies using positive expiratory pressure through a device.

Risk of bias

All studies included in the systematic review described follow-up losses and exclusions. Fifty percent presented adequate sequential generation, characterizing a low risk of bias for these items. In 37.5% of the studies, allocation confidentiality was reported, blinded outcome assessment and the intention-to-treat principle were used for statistical analysis, showing a moderate risk of bias. Only 25% included blinded patients (high risk of bias) (Table 2).

Table 2 Risk of bias of included studies.

Effects of interventions

Inspiratory capacity

Five of the included studies evaluated dynamic hyperinflation17,18,24,26,27. IC was compared during exercise of the lower limbs without EPAP and EPAP 10 cmH2O, with no significant effect on reducing the onset of DH (0.04 L; 95% CI − 0.1 to 0.2; I2 = 0%; p = 0.70). This same result occurred when the pressures of 5 and 10 cmH2O were evaluated (− 0.02 L; 95% CI − 0.1 to 0.1; I2 = 8%; p = 0.86) (Fig. 2), as well as in the pressure subgroup analysis presented in Table 1. Based on the GRADE approach, the quality of evidence for this outcome—both when considering all pressure levels and when only 10 cmH2O was applied—was very low (Table 4).

Figure 2
figure 2

Comparison of inspiratory capacity without EPAP versus EPAP from 5 to 10 cmH2O. a5 cmH2O; b10cmH2O.

Exercise capacity

Two studies included the assessment of exercise capacity to the tolerable limit in an endurance CEPT (Tlim)24,27, and three studies used the distance covered in the 6MWT17,25,26. Tlim was compared with the use of 5, 7.5, and 10 cmH2O EPAP in two studies, showing a significant reduction in exercise capacity when EPAP was applied at the three pressure levels used (− 214.8 seg; 95% CI − 400.2 to − 29.4; I2 = 0%; p = 0.02). The same behavior was observed when the subgroups with pressures ranging from 5 to 7.5 cm H2O or 7,5–10 cmH2O were analyzed (Table 3). The distance covered in the 6MWT was studied by applying the 5 cmH2O EPAP in one study and 10 cmH2O in the other two. No improvement in the 6MWT performance was observed with both pressure levels (1.7 m; 95% CI − 35.7 to 39.1; I2 = 38%; p = 0.93) (Fig. 3). Based on the GRADE approach, the quality of evidence for this outcome was considered very low (Table 4).

Table 3 Analysis of pressure subgroups.
Table 4 Quality of evidence using The GRADE approach.
Figure 3
figure 3

Comparison between exercise capacity with EPAP application at different pressure levels. a5 cmH2O; b10cmH2O.

Dyspnea sensation

Five studies included in the meta-analysis evaluated the sensation of dyspnea using the adapted (0–10) Borg Scale17,23,24,25,26. Two studies used only 5 cmH2O23,25, another two used 10 cmH2O17,26 and the last one used both24. There was no significant reduction in the sensation of dyspnea when evaluating studies that used 5 cmH2O (− 0.2; 95% CI − 2.8 to 2.2; I2 = 94%; p = 0.82), as well as when used only 10 cmH2O (0.4; 95% CI − 0.4 to 1.3; I2 = 0%; p = 0.34), or considering both pressure levels (0.04; 95% CI − 1.3 to 1.4; I2 = 86%; p = 0.95) (Fig. 4). Based on the GRADE approach, the quality of evidence for this outcome was considered very low (Table 3).

Figure 4
figure 4

Comparison of dyspnea sensation assessed by the Borg scale, with 5 and 10 cmH2O EPAP. a5 cmH2O; b10cmH2O.

Respiratory rate

Three studies included in the meta-analysis evaluated respiratory rate24,25,26. One study used only a 5 cmH2O pressure25, another used 10 cmH2O26, and the last one used both24. 5 cmH2O EPAP caused a significant reduction in RR compared to controls (− 2.3 bpm; 95% CI − 4.5 to 0.1; I2 = 0%; p = 0.04). However, there was no change in RR when a higher pressure level of 10 cmH2O was applied (− 0.1 bpm; 95% CI − 2.7 to 2.4; I2 = 0%; p = 0.90), or when the analysis was performed without stratification by EPAP pressure (− 1.4 bpm; 95% CI − 3.0 to 0.2; I2 = 0%; p = 0.10) (Fig. 5). Based on the GRADE approach, the quality of evidence for this outcome, when considering only the 5 cmH2O pressure, was low, and for 10 cmH2O, the evidence was considered very low (Table 4).

Figure 5
figure 5

Comparison between respiratory rate with EPAP application at different pressure levels. a5 cmH2O; b10cmH2O.

Oxyhemoglobin saturation by pulse oximetry

Five studies evaluated peripheral oxygen saturation17,24,25,26,27, two used 5 cmH2O EPAP24,25, one used 7.5 cmH2O27, and three used 10 cmH2O17,24,26. The use of 5 cmH2O EPAP (0.5%; 95% CI − 0.7 to 1.9; I2 = 31%; p = 0.40), 10 cmH2O (0.7%; 95% CI − 0.9 to 2.4; I2 = 12%; p = 0.39) or 5–10 cmH2O (0.5%; 95% CI − 0.4 to 1.5; I2 = 0%; p = 0.25) did not cause significant changes in SpO2 during exercise (Fig. 6). The same behavior was observed when the subgroups with pressures ranging from 5 to 7.5 cm H2O or 7,5 to 10 cmH2O were analyzed (Table 3). Based on the GRADE approach, the quality of evidence for SpO2, both when considering all pressures, and when only 5–10 cmH2O, was very low (Table 4).

Figure 6
figure 6

Comparison of oxyhemoglobin saturation by pulse oximetry with 5–10 cmH2O EPAP. a5 cmH2O; b10cmH2O.

Leg discomfort

Two studies evaluated lower limb discomfort with 10 cmH2O EPAP by the adapted Borg Scale24,26. The use of EPAP did not modify lower limb discomfort during exercise (0.1; 95% CI − 0.3 to 0.7; I2 = 0%; p = 0.53) (Fig. 7). Based on the GRADE approach, the quality of evidence for this outcome was considered very low (Table 3).

Figure 7
figure 7

Comparison of leg discomfort assessed using the Borg scale, with 10 cmH2O EPAP. b10cmH2O.

Discussion

This systematic review that studied the use of EPAP during LL exercises in individuals with COPD showed that this device did not modify DH and reduced exercise time (Tlim) in the constant load test, while the distance covered in the 6MWT remained unchanged. EPAP did not change symptoms and desaturation at exercise end. At the end of the exercise the RR value presented an increase of around 3.8 ± 6.9 bpm with a 5 cmH2O EPAP, while controls had a greater increase of nearly 6.6 ± 5.5 bpm without the use of EPAP.

The effect of EPAP on DH in COPD was studied previously by other authors. Monteiro et al.18. evaluated the effect of EPAP, applied through a face mask, with an approximate pressure level of 8 cmH2O, on the onset of DH during submaximal exercise in patients with moderate to very severe COPD. This study demonstrated that the application of this pressure modality was associated with exercise-induced attenuation of DH (i.e., a lower decrease in inspiratory capacity (IC) immediately after exercise). Similarly, Wibmer et al.17. also observed a reduction in DH after performing the six-minute walk test (6MWT) with 10 cmH2O nasal EPAP in patients with moderate to severe COPD. Another study showed that a 5 cmH2O EPAP during exercise reduced ventilation and physiological dead space, but the degree of baseline hyperinflation and DH were not evaluated23.

We hypothesized that EPAP would reduced exercise-induced DH in individuals with COPD. This effect was not demonstrated by this meta-analysis, in which DH, measured by serial IC measurements, did not change significantly by applying EPAP ranging from 5 to 20 cmH2O. One of the mechanisms that could reduce DH would be through the reduction of RR, which was observed in the meta-analysis. Reduction of RR with 5 cmH2O EPAP during LL exercise may be due to less airway collapse, which may lead to prolonged expiratory time31,32 and thereby reduce lung volumes23, which could theoretically decrease DH33. However, no reduction in DH was observed.

Another reason to explain the ineffectiveness of EPAP in reducing DH in some studies is the use of bronchodilators by patients, because it is possible that offering them to all participants before exercise may have minimized the effects of positive pressure on IC. In the sample included in this meta-analysis, five studies mentioned long-term use of bronchodilators by patients17,18,23,24,25, and three of these were associated with use of short-term bronchodilators24,25. Thus, findings in patients under this condition could not be generalized to those without recent bronchodilator use, since the use of bronchodilators immediately before an exercise test interferes with the degree of DH developed during the examination24.

Regarding the pressure level used in EPAP in the articles analyzed, these ranged from 5–10 cmH2O. Thus, only the study conducted by Monteiro et al.18. allowed the adjusted pressure level to be the one in which the patient reported greater comfort, and this level was on average 8 ± 1.5 cmH2O. It is also important to mention that in the study conducted by Wibmer et al.17. the minimum pressure level was 10 cmH2O, however it could reach up to 20 cmH2O, because the device used for positive pressure generation was based on a silicone nasal mask with an adjustable orifice resistor, which was generally capable of a flow-dependent expiratory pressure. The authors pointed out that all subjects received positive pressure with the device's expiratory resistance set to the largest available opening (5.0 mm). Thus, pressure generation would be close to 10 cmH2O, but as pressure level generation was flow-dependent, this value may have been exceeded. However, in our meta-analysis, when this study was omitted to assess possible individual study influences on the outcomes, heterogeneity and weighted mean difference remained unchanged.

The use of EPAP reduced exercise capacity (Tlim) measured by the constant load test. It is possible that there has been a significant reduction in VO2 and systolic volume assessed through the oxygen pulse, as previously shown24,34, indicating compromised hemodynamic response. This finding is related to the decrease in venous return caused by excessive recruitment of expiratory muscles35, which may lead to reduced ventilation/perfusion ratio and cardiac output36, where such associated factors may not have allowed that the effect of EPAP on this outcome was demonstrated.

Another factor that may also have influenced exercise capacity is the increase in sympathetic vasomotor outflow during the test, which can be observed through increased expiratory resistance37. When added to central hemodynamics and ventilatory restriction, all these mechanisms may contribute to an impaired exercise capacity. In addition to the physiological factors mentioned, the type of test used to assess this outcome could have influenced the positive or negative results regarding the application of EPAP. Thus, as performed, in relation to the pressure level, when a study included in the meta-analysis presented a different exercise capacity test, it was omitted in order to observe possible individual influences on the results regarding the heterogeneity and the weighted mean difference; however, results remained unchanged.

Our study has several methodological strengths. These are comprehensive and systematic bibliographic research, the collaboration of a multidisciplinary team of health researchers, and methodologies that used explicit and reproducible eligibility criteria. In addition, we performed a meta-analysis to quantitatively express the results obtained and assess the quality of evidence for each outcome analyzed.

We found that many of the studies were methodologically limited by a high risk of bias. Only one study clearly presented blinding (patients and evaluators), and allocation concealment confidentiality25. However, all studies17,18,23,24,25,26,27 described the losses and exclusions that occurred during the follow-up period. Thus, sensitivity analyzes were partially impaired by the methodological quality presented by the included studies and the small number of studies and participants. Moreover, the included studies do not have enough statistical power, because even performing the meta-analysis, the 95% confidence intervals remained quite wide. Moreover, according to the GRADE approach, most results presented very low quality of evidence. This indicates that any effect estimate is very inaccurate, and it is very likely that further research will have a more important impact on our confidence to estimate the effect, suggesting that further studies with a larger number of subjects and stricter methodological criteria should be performed.

Due to the statistical heterogeneity found in the meta-analysis, we performed a detailed exploration of sources of heterogeneity between studies, including a detailed description of sensitivity analysis and subgroup analysis. The steps used to analyze the moderate and high heterogeneities of the studies were (1) perform the meta-analysis removing one article at a time to check if any individual study explained the heterogeneity, and (2) perform the sensitivity analyses based on the pressure level used and the type of exercise test which the patients underwent. Despite this, in some results, both the pressure level and the type of exercise test used do not seem to influence the meta-analysis results.

Although the main results found in our study, some limitations need to be considered. Since five from the seven included studies were randomized crossover trials17,18,23,24,26,27, the first intervention may have generated an impact in the following in case the washout period was insufficient. From the six studies selected, two did not mentioned the washout period23,27, two reported a 48 h period and its patients underwent the CPET18,24, another study reported a washout period of 2–24 h17 and the last one reported only 1 h 26, being that in the last two studies the patients underwent the 6MWT. Another significant limitation is the sample selection, since five from the seven selected studies included individuals with severe to moderate COPD17,18,23,24,25 and two recruited individuals with severe to highly severe COPD26,27. This data is relevant since patients with a higher severity of the disease may report an increased resting hyperinflation and/or DH during exercise, therefore, could have a greater benefit from the EPAP application. On the other hand, if the level of EPAP provided is too high for a patient, it may worsen the hyperinflation, have a negative effect on pulmonary mechanics, increase work of breathing and reduce exercise capacity.

Our study demonstrates that EPAP at different pressure levels during LL exercise in patients with COPD did not change DH and the distance covered in the 6MWT, but worsened performance in constant load exercise. The use of EPAP during exercise did not change symptom intensity, desaturation, or heart rate. There was a significant reduction in respiratory rate with the use of EPAP. Due to the low methodological rigor of the included articles and the small sample size of the studies, further randomized clinical trials should be performed to corroborate these findings.