Introduction

Lung cancer remains the leading cause of cancer mortality worldwide1, the majority of lung cancer is non-small cell lung cancer (NSCLC)2. For patients with unresectable NSCLC, radiation therapy (RT) treatment options include concurrent chemoradiation (CRT), sequential CRT, curative RT and palliative RT.

Although the treatment regimen that provides the highest cure rate for each disease stage is well established, population studies have shown that treatment in NSCLC is consistent with guidelines in only 44–52% of cases3,4,5, and radiotherapy remains underutilised across the world6. While many factors influence the under-utilisation of radiotherapy, an important aspect is clinician concern regarding treatment-related toxicity, where treatments associated with better survival outcomes have increased toxicity. Comorbidity potentially influencing treatment is prevalent in 72%-81% of lung cancer patients7,8,9. This has been associated with reduced likelihood of patients receiving radiotherapy7.

Numerous studies now reported on survival prediction models for NSCLC, two from the MAASTRO group10,11. These both show that even with curative radiotherapy (± chemotherapy), there are different prognostic groups of patients, some who do poorly despite radical RT and some who do well. If clinicians are to use these models then the patient also needs to be informed of toxicity predictions for shared decision making. Some ‘poor risk’ patients may choose to accept higher toxicity rates with curative RT despite small survival gains, and others may not. However, the available literature can be difficult to interpret when quantifying the rate of toxicity between different treatment regimes. Due to the variable toxicity types and rates that is reported in individual trials, better estimates of toxicities would be helpful in guiding clinical management.

The aim of this systematic review and meta-analysis is to evaluate and statistically combine toxicity rates of curative and palliative RT (excluding stereotactic body radiation therapy) with or without chemotherapy for patients with unresectable NSCLC. This information increases the precision of the quantitative estimates of the toxicity rates compared to individual trials.

Methods

A systematic search of electronic databases (MEDLINE, PubMed, EMBASE, and the Cochrane Central Register) was performed using the following terms: non-small cell lung cancer, radiation therapy, radiotherapy, randomised controlled trial, controlled clinical trial, controlled trials, systematic review, and meta-analysis. We included recent studies published between January 2000 and June 2019. Searches were limited to human studies published in English. When multiple studies of the same clinical trial were encountered, the updated results were included. The PRISMA guidelines (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) were used to assist in writing this review12.

References identified by the search strategy were screened independently by two investigators (M.O. and B.L.) to review the trials for eligibility for inclusion and the list of trials eligible for inclusion was agreed.

Inclusion and exclusion criteria

Studies that met the following criteria were included: published randomised trial with greater than 50 patients with unresectable NSCLC undergoing curative and/or palliative RT. Curative RT was defined as a minimum dose of 50 Gy, or its radiobiological equivalent, with or without chemotherapy13. Palliative RT was defined as a dose of less than 50 Gy. Unresectable disease could be medically or surgically inoperable.

We excluded trials with small cell lung cancer or recurrent lung cancer. Patients treated with prior high dose RT in region of lung cancer, stereotactic body radiation therapy (SBRT), protons, carbon-ions, post-operative RT or palliative CRT were also excluded.

Data extraction

The details of included trials were recorded independently by two authors (M.O and B.L) via a data collection template (Appendix 1). Any discrepancy was resolved by consensus with third party (J.L.). Patient and trial characteristics, including disease stage, median age, study type, follow-up and toxicity criteria used were extracted along with summary information on treatment characteristics (treatment regime, chemotherapy type and timing, dose and fractionation). Treatment-related toxicity for each RT regimen was extracted, including the incidence and grade of oesophagitis, pneumonitis, cardiac events, pulmonary fibrosis, radiation myelopathy, neutropenia and/or treatment-related death (TRD).

Statistical analysis

The pooled risk of toxicities by any grade, grade ≥ 3, and treatment related deaths were expressed as the total number of cases for each toxicity outcome divided by the total number of patients treated with the same type of treatment. Treatment regimens were categorised into palliative RT alone, curative RT alone, sequential CRT and concurrent CRT. We performed indirect comparisons to estimate the risk ratio for the comparison between palliative versus curative RT and sequential versus concurrent CRT. The Mantel–Haenszel fixed-effect method was used to obtain the pooled risk ratio and corresponding confidence interval. We used the fixed-effect method for all comparisons for consistency. Statistical heterogeneity was assessed by calculating I2. Cochrane Review Manager version 5.3 (Cochrane Collaboration, Copenhagen, Denmark) was used for the analyses.

Quality assessment

The risk of bias for each trial was assessed using the criteria outlined in the Cochrane Handbook for Systematic Reviews of Interventions14. These include random sequence generation, allocation concealment, incomplete outcome data, selective reporting and other biases (such as method of assessing symptoms).

Results

Eligible studies

We identified 49 eligible trials15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64 with a total of 10,388 patients, of which 8609 were evaluable for toxicity (Fig. 1). The overall trial characteristics are shown in Table 1. There was variability in the reporting of symptoms, with various versions of 5 different toxicity grading criteria used in 39 of the included trials. 8 trials included stage IV patients accounting for 1835 patients. 5 of these were palliative trials and the remaining 3 trials only had a small proportion (23 patients) of stage IV disease. Treatment characteristics of included trials are summarised in Table 2 demonstrating the heterogeneity with respect to the study design, toxicity scoring criteria, treatment arms, RT dose fractionation and chemotherapy regimen. There was a wide range of RT dose fractionation used, from 10 Gy in 1 fraction for palliative RT, up to 74 Gy in 37 fractions in concurrent CRT. Most chemotherapy regimens were platinum-based. 2 studies assessed elderly patients17,18,60.

Figure 1
figure 1

PRISMA Flow diagram12 with details of the number of studies identified, screened, assessed and included in the final review.

Table 1 Patient and study characteristics of included trials.
Table 2 Treatment characteristics of included trials.

Treatment-related death

The overall rate of TRD was low on indirect comparisons, highest in concurrent CRT (3.1%), followed by sequential CRT (2.3%), curative radiation alone (2.4%) and palliative radiation (0%). In the 6 trials20,22,27,33,51,53 that directly compared concurrent with sequential CRT, TRD from concurrent was higher than sequential CRT but the difference was not statistically significant (5.1% vs 2.7%, p = 0.05) (Fig. 2). In the one trial30 that compared TRD in sequential CRT with curative RT alone, the rate of TRD was higher in sequential CRT but the absolute difference was small (1.7% vs 0.9%), insufficient for meta-analysis. No trial directly compared palliative with curative RT for TRD.

Figure 2
figure 2

Forest plot showing toxicity risk ratio (RR) for treatment-related death; comparison between sequential versus concurrent chemoradiation, generated with Cochrane Review Manager version 5.3

Oesophagitis

Grade ≥ 3 oesophagitis from concurrent CRT was statistically significantly higher than sequential CRT in 9 trials (22.2% vs 6.4%, p < 0.0001) (Fig. 3A). Any grade oesophagitis from concurrent CRT was also statistically significantly higher than sequential CRT in 3 trials (79.0% vs 70.4%, p = 0.04) (Fig. 3B). 2 trials48,59 compared any grade oesophagitis between curative RT and palliative RT, this was higher in curative but the difference was not statistically significant (35.4% vs 26.6%, p = 0.06) (Fig. 3C). Trials were not sufficient for meta-analysis in other comparison groups in assessing grade ≥ 3 or any grade oesophagitis.

Figure 3
figure 3

Forest plot showing toxicity risk ratio (RR) for: (A) grade ≥ 3 oesophagitis, comparison between sequential versus concurrent chemoradiation; (B) any grade oesophagitis, comparison between sequential versus concurrent chemoradiation; (C) any grade oesophagitis, comparison between palliative versus curative radiation therapy, generated with Cochrane Review Manager version 5.3

Pneumonitis

In the 7 trials that directly compared concurrent with sequential CRT, Grade ≥ 3 pneumonitis from concurrent was higher than sequential CRT, but not statistically significant (11.1% vs 8.7%, p = 0.26) (Fig. 4A). 2 trials38,53 directly compared any grade pneumonitis, demonstrating the rate from concurrent CRT was not statistically significantly higher than sequential CRT (10.0% vs 5.7%, p = 0.16) (Fig. 4B). Trials were not sufficient for meta-analysis in other comparison groups in assessing grade ≥ 3 or any grade pneumonitis.

Figure 4
figure 4

Forest plot showing toxicity risk ratio (RR), comparison between sequential versus concurrent chemoradiation for: (A) grade ≥ 3 pneumonitis; (B) any grade pneumonitis; (C) grade ≥ 3 neutropenia; (D) grade ≥ 3 cardiac event, generated with Cochrane Review Manager version 5.3

Neutropenia

Neutropenia was reported in different time intervals following chemotherapy or not specified. Selective reporting of febrile neutropenia was also identified. 3 trials directly compared Grade ≥ 3 neutropenia between concurrent and sequential CRT. Rates from concurrent was higher than sequential CRT, but not statistically significant (58.2% vs 55.4%, p = 0.56) (Fig. 4C). Trials were not sufficient for meta-analysis in assessing any grade neutropenia.

Cardiac adverse events

4 trials directly compared grade ≥ 3 cardiac events between concurrent and sequential CRT. The rates from concurrent was higher than sequential CRT, but not statistically significant (4.3% vs 1.8%, p = 0.08) (Fig. 4D). Trials were not sufficient for meta-analysis in other comparison groups for grade ≥ 3 or any grade cardiac events.

Pulmonary fibrosis and myelopathy

Pulmonary fibrosis and radiation myelopathy were poorly reported in the studies. Only 7 trials reported pulmonary fibrosis and 9 trials reported myelopathy across all treatment groups; meta-analysis to compare between groups was not feasible. The rate of pulmonary fibrosis (any grade) was higher in the palliative RT and curative RT arms than the sequential CRT and concurrent CRT. This finding is strongly influenced by a single study by Nestle et al. which reported 100% rate of pulmonary fibrosis based on imaging rather than clinical symptoms.

Toxicity stratified by stage

Trials with only stage III NSCLC comparing sequential versus concurrent CRT were analysed (see Fig. 5). The rate of grade ≥ 3 oesophagitis was statistically higher for concurrent CRT (16.6% vs 7.4%, p < 0.0001), whilst the difference in rates of treatment related death, grade ≥ 3 pneumonitis and grade ≥ 3 cardiac events were not statistically significant. Trials were not sufficient for analysis stratified by stage for stages I, II or IV disease.

Figure 5
figure 5

Forest plot showing toxicity risk ratio (RR) for Stage III only studies, comparison between sequential versus concurrent chemoradiation: (A) treatment-related deaths; (B) grade ≥ 3 oesophagitis; (C) grade ≥ 3 pneumonitis; (D) grade ≥ 3 cardiac event, generated with Cochrane Review Manager version 5.3

Pooled toxicity rates

Overall, the pooled grade ≥ 3 and any grade toxicities rates were lower with sequential compared with concurrent CRT. On pooled comparisons, Grade ≥ 3 (RR 0.75; CI 0.65–0.87) and any grade neutropenia (RR 0.55; CI 0.47–0.64) were significantly less with sequential compared with concurrent CRT. Grade ≥ 3 oesophagitis was also significantly lower with sequential CRT (RR 0.42; CI 0.32–0.54) but not any grade oesophagitis. Any grade cardiac events (RR 0.48; CI 0.23–0.98) and pulmonary fibrosis (RR 0.36; CI 0.20–0.63) were significantly less with sequential compared with concurrent CRT (see Tables 3 and 4).

Table 3 Summary table of the pooled grade ≥ 3 toxicity rates.
Table 4 Summary table of the pooled any grade toxicity rates.

Any grade toxicities were also lower with palliative compared with curative radiation alone. On pooled comparisons, any grade oesophagitis (RR 0.40; CI 0.32–0.50) and pneumonitis (RR 0.04; CI 0.02–0.09) were significantly less with palliative compared with curative RT alone (see Tables 3 and 4).

The range of reported grade ≥ 3 oesophagitis was 0 to 41.4% for concurrent CRT and 0 to 20.4% for sequential CRT, whilst any grade oesophagitis ranged between 46.4% and 100% for concurrent CRT and 36.4% to 100% in sequential CRT.

Risk of bias

Reporting bias were identified with incomplete data and selective reporting of toxicities in most studies, resulting in an overall high risk of bias. Funnel plots were generated to visually assess for publication bias. Symmetrical funnel plots were obtained for comparison groups (> 5 studies) between sequential and concurrent CRT in grade 3 oesophagitis, pneumonitis and treatment-related deaths.

Discussion

In lung cancer clinical decision making, the consideration of toxicity is essential. As expected, patients receiving palliative RT had lowest toxicity, followed by curative RT alone, sequential and highest with concurrent chemoradiation. The benefit of this review is to provide better estimates of each toxicity effect compared to individual trials.

Acute oesophagitis is one of the main morbidities from lung irradiation. The large differences between individual trials makes it difficult for clinicians to estimate the toxicity in the process of informed consent. The grade ≥ 3 rate with curative RT without chemotherapy is low (0.5%). However, only 4 trials were included, 2 of which included stage I patients only17,19,25,49. Although this toxicity is significantly higher with concurrent CRT, it should not be used alone as a factor to preclude concurrent treatment. Oesophagitis can be managed with nutritional support and admission and rarely leads to late stenosis. In addition, IMRT have reduced the incidence of this65,66. This difference in oesophagitis rates should be considered as oesophagitis may impact on survival67.

Pneumonitis occurs sub-acutely and is the main toxicity of concern as it can result in death. Although the risk of any grade pneumonitis is high for all curative radiotherapy, the risk of Grade 3 + pneumonitis is < 10%. In addition, we found no significant difference between concurrent versus sequential CRT. This suggests that decisions regarding the sequencing of treatment should not be based on the anticipated risk of pneumonitis. However, the increasing use of adjuvant or palliative immunotherapy when combined with prior radiotherapy may potentially increase future pneumonitis risk.

Cardiac toxicity encompasses a range of disorders. Nearly all studies reported cardiac toxicity as a general outcome “cardiac” rather than specifying individual events. The pathophysiology and dose resulting in an event is likely to differ. The risk of grade ≥ 3 toxicity has been correlated with pre-existing cardiac disease and mean heart dose68. In breast cancer, Darby et al. found the rates of major coronary events increased linearly with the mean heart dose by 7.4% per Gray69. Moreover, data from RTOG 0617 showed heart dose is an independent factor for overall survival70. However, a systematic review which includes 3 studies from RTOG 0617 found that heart dose-volume parameters were not consistently associated with survival or cardiac toxicity71. Although reduction of heart dose is ideal, any de-escalation of therapy should be carefully weighed against the resulting inferior cure rates71,72,73,74.

Toxicities for the elderly population are not well established due to the lack of and under-representation in randomised trials. The EORTC and SIOG groups recommended chemotherapy to be considered only in selected fit elderly patients, as the added toxicity may outweigh survival benefit75. In this systematic review, there were only two included studies that specified elderly toxicity rates, reflecting the need for randomised trials in this group to aid determine best suitable treatment.

Only one trial included adjuvant immunotherapy and no palliative immunotherapy was used. The studies included treatment with various radiation technique, dose fractionation, including escalated therapy (radiation dose24,61 or systemic therapy). Due to changes in radiation technique, older studies (prior to 2000) were not included in this review. Advanced radiation technique such as 3-dimentional compared with 2-dimensional palliative RT to improve conformality can reduce toxicities76. Secondary analysis from the RTOG 0617 also confirms that IMRT was associated with lower rates of severe pneumonitis and cardiac doses in locally advanced NSCLC77. We did not review toxicities relating to SBRT or the impact of non-chemotherapy systematic therapy.

There are several limitations which are inherent to systematic reviews of randomised trials78. Whilst the selected good performance status patients may limit the generalisability, the rates reported in this review may be higher due to escalation of treatment in the experimental arms. On the other hand, real-world patients may also have pre-existing comorbidities and other patient factors which could increase toxicity. This review was unable to analyse toxicity rates based on dose-volume parameters due to insufficient data published in the trials included. Moreover, the pooled rates reported are averages of the toxicity from treatment in different stages. This likely results in an overestimation of risk for those with stage I compared to III disease79. The incidence of toxicities reported are crude estimates between the number of patients with toxicities and the total number of patients treated. Actuarial estimates provides a more accurate determination toxicities prevalence80. The findings from this review should be interpreted with some caution.

We included randomised trials as protocols with prospective data generally provides better-quality toxicity data. In concordance with Sivendran et al. on adverse event reporting in cancer clinical trial publications, we identified selectivity and heterogeneity with reporting toxicities in trials81. The quality of studies examined ranged from low to high, contributed by reporting bias. There was variation in timing of reported toxicities, different toxicity grading criteria used and limited studies on quality of life. This highlights the need for trials to report reliable toxicity data, ideally under standardised criteria and in conjunction with the Consolidated Standards of Reporting Trials (CONSORT) recommendations82.

To the best of our knowledge this is the only available review of toxicity data in recent trials that compares and provides estimates of palliative radiotherapy, curative radiotherapy, sequential and concurrent chemoradiotherapy regimens. The only statistically significant difference between treatment regimens was the rate of oesophagitis with concurrent versus sequential CRT. This information is clinically useful and should be considered by clinicians and patients when weighing up the established survival benefits with the toxicity of the different treatment options.