Main

Pancreatic cancer remains one of the most fatal and least understood human malignancies and continues to be a major unsolved health problem (Melisi et al, 2014). It has a five-year relative survival rate at 7.7% (SEER Stat Fact Sheets), and is projected to become the second leading cause of cancer-related death by the year 2030 in western countries (Rahib et al, 2014). More than 338 000 people per year are diagnosed with pancreatic cancer worldwide, and approximately the same number die of this disease (Ferlay et al, 2013). The poor prognosis associated with pancreatic cancer can be attributed to the disease’s early metastatic behaviour during progression, its aggressive course, and, in particular, to the limited efficacy of currently approved classic chemotherapeutic treatments (Tamburrino et al, 2013). Current treatment guidelines – most notably those from the National Comprehensive Cancer Network (version 2, 2016) and the European Society for Medical Oncology (ESMO, 2015) – recommend the following first-line treatment options for advanced pancreatic cancer depending on patients’ performance status (measured by Eastern Cooperative Oncology Group (ECOG) scale): gemcitabine and nab-paclitaxel combination therapy or fluorouracil–leucovorin–irinotecan–oxaliplatin (FOLFIRINOX) regimen (ECOG 0 or 1), and these two regimens are preferred chemotherapy as recommended in National Comprehensive Cancer Network and other acceptable regimens include gemcitabine+erlotinib, gemcitabine+capecitabine, and so on; gemcitabine monotherapy (ECOG 2); or best supportive care (ECOG 3–4 or comorbidities). However, the majority of patients progress while on first-line therapy and have limited proven options in the second-line setting. National Comprehensive Cancer Network guideline(s) have suggested that fluoropyrimidine-based chemotherapy, such as 5-fluorouracil and leucovorin (5-FU/LV), is an acceptable second-line option among patients previously treated with gemcitabine-based therapy (Tempero et al, 2014; Ducreux et al, 2015; Oettle et al, 2015).

Nanoliposomal irinotecan (nal-IRI; US trade name Onivyde) was recently approved by the US Food and Drug Administration and the European Medicines Agency review for the treatment of metastatic pancreatic adenocarcinoma (mPAC) in combination with 5-FU/LV in patients previously treated with gemcitabine-based therapy. The US Food and Drug Administration approved nal-IRI+5-FU/LV based on results from the NAPOLI-1 (NAnoliPOsomaLIrinotecan) clinical trial (NLM identifier: NCT01494506). NAPOLI-1 was a global, multicenter, open-label, Phase 3 trial that included adult metastatic pancreatic cancer patients whose disease had progressed following previous gemcitabine-based therapy (Wang-Gillam et al, 2016). The NAPOLI-1 trial demonstrated that the nal-IRI+5-FU/LV significantly improved patients’ median overall survival (OS) (6.1 vs 4.2 months, HR=0.67, P=0.012) and median progression-free survival (3.1 vs 1.5 months, HR=0.56, P=0.0001) compared with 5-FU/LV therapy alone. In its 2015 updated guidelines, ESMO indicated nal-IRI+5-FU+folinic acid as the best second-line treatment option based on the currently available evidence for this patient population (Ducreux et al, 2015).

The present analysis aimed to compare the quality-adjusted time without symptoms of disease progression or toxicity (Q-TWiST) of nal-IRI+5-FU/LV treatment over 5-FU/LV treatment alone using the NAPOLI-1 clinical trial data. The Q-TWiST method (Chen et al, 2015) integrates the quality and quantity of survival by partitioning survival time into three clinically relevant periods, which are assigned different quality-of-life weights (i.e., ‘utilities’), including the periods in which patients (1) experienced toxicity (TOX) due to treatment, (2) did not experience any TOX and had not yet progressed (i.e., pre-progression time without adverse events (AEs)), and (3) experienced progression after disease recurrence. This approach facilitates comparison between treatments by penalising treatments with increased toxicities or shorter times to disease progression and rewarding those with lower TOX and longer progression-free and OS times (Cole and Gelber, 1995).

Materials and methods

Study cohorts

The study cohorts for this analysis were from the NAPOLI-1 clinical trial (Chen et al, 2015; Wang-Gillam et al, 2016). Patients were randomised into three treatment arms: nal-IRI, 5FU/LV, and nal-IRI+5-FU/LV (1 : 1 : 1 ratio). The primary end point was OS; secondary end points included progression-free survival, time to treatment failure, and tumour reponse. The study’s full inclusion and exclusion criteria and findings can be found in the NAPOLI-1 primary publication (Wang-Gillam et al, 2016).

The primary Q-TWiST analysis was performed using the intent to treat population, and was supplemented with analyses in the per-protocol population (i.e., patients who received 80% of the protocol-defined treatment during the first 6 weeks of treatment) (Chen et al, 2015) and analyses in nine pre-specified subgroups: race (Asian), baseline Karnofsky performance status scores (<90, 90–100), time since treatment (<1.3 months, >1.3 months), cancer stage at diagnosis (stage IV, other), and tumour location (head, other). The Q-TWiST analysis was performed specifically on the combination treatment arm (nal-IRI+5-FU/LV) vs 5-FU/LV. The current study focused on the treatment arms amended after safety data on the combination became available, which is consistent with the primary analysis (Chen et al, 2015).

Statistical analysis

The Q-TWiST method combines treatment benefits and risks into a single measure by partitioning survival time into three health states and subsequently assigning quality-of-life weights to the survival time in each state (Gelber et al, 1993). These three health states include time with grade 3 TOX (i.e., all-cause AEs), disease progression (REL), and time without disease progression symptoms or grade 3 toxicity (TWiST). The restricted mean duration of each health state was calculated as the area under its Kaplan–Meier curve through 12 months of follow-up (Kaplan and Meier, 1958; Fairclough, 2010).

TOX time was defined as the period during which a patient experienced AEs between randomisation and progression. It was calculated as the total number of days between randomisation and progression in which each patient experienced grade 3 all-cause adverse events (AEs; defined using the National Cancer Institute Common Terminology Criteria for Adverse Events). Any day with multiple AEs was only counted once, and the total number of TOX days were grouped together during a patient’s treatment, regardless of when TOX days occurred (but only counting AEs prior to relapse). REL time was defined as the time after disease progression (i.e., the period when patients experience disease relapse).

The Q-TWiST value is the linear combination of the health state durations adjusted by the respective utilities (U):

Within this analysis, two assumptions about TOX time were made: (1) the utility associated with TOX was the same regardless of AE type or grade; (2) AE duration was truncated on the date the patient progressed if the AE continued past that date.

In the base case, the utility of TOX and REL were both assigned as 0.5 and TWiST was assigned as 1.0 consistent with prior Q-TWiST analyses in various oncology disease areas (Solem et al, 2014). In a threshold analysis, the difference in Q-TWiST between treatments was calculated over all possible combinations of U(TOX) and U(REL), ranging from 0.0 to 1.0. To understand how the utility weight of TWiST affects the results, alternative utility weights were used in a sensitivity analysis: U(TWiST)=0.80, U(TOX)=0.65, and U(REL)=0.75. These alternative utility weights of TWiST and REL were obtained from a clinical trial, which prospectively collected utility data of patients with mPAC (Romanus et al, 2012) who received gemcitabine monotherapy or gemcitabine-based combination therapy. The utility weight of TOX was calculated based on a series of study results reporting the utility decrement of AEs (Lloyd et al, 2006; Nafees et al, 2008; Swinburn et al, 2010; Tolley et al, 2013; Goldstein et al, 2014). Additionally, since afebrile neutropenia may not be clinically symptomatic, a sensitivity analysis was performed to assess the Q-TWiST difference when removing the AE of neutropenia from TOX time. Given that certain grade 2 toxicities also impact patients’ quality of life, such as fatigue, a sensitivity analysis was performed by including grade 2 toxicities in the TOX time calculation.

Consistent with Revicki et al (2006), the relative gain in Q-TWiST in nal-IRI+5-FU/LV over 5-FU/LV was calculated as the difference in Q-TWiST divided by the OS time of 5-FU/LV group (i.e., control group). It was reported for the base case Q-TWiST analysis and threshold analysis. The relative gain in Q-TWiST was used to define clinical important difference (10%) and clearly clinically important difference (15%) of Q-TWiST in Revicki et al.

Statistical testing was limited to a comparison of the overlap of 95% confidence intervals (CIs), which were calculated using 1000 sample nonparametric bootstraps of the data (Fairclough, 2010). Mean durations of each health state and quality-adjusted survival time were compared and differences calculated between treatment groups. If the 95% CI did not overlap between treatment groups (or the 95% CI around the difference did not overlap 0), the difference was considered statistically significant. SAS version 9.4 (SAS Institute, Inc., Cary, NC, USA) was used to perform the statistical analyses.

Results

Patient sample

A total of 236 patients were randomised to nal-IRI+5-FU/LV treatment (n=117) or 5-FU/LV treatment (n=119). Patients randomised to nal-IRI+5-FU/LV had a mean (s.d.) age of 63.2 (9.06) years and were 59% male and 29.1% Asian, while 5-FU/LV patients were aged 61.0 (9.46) years, 56.3% male, and 30.3% Asian (Table 1). There were no significant differences in baseline characteristics between treatment groups.

Table 1 Study populations and demographics

Duration of time spent in each health state

The mean duration of time spent in TWiST, TOX, and REL was significantly longer among patients receiving nal-IRI+5-FU/LV vs 5FU/LV alone (Table 2). In both cohorts, the mean TWiST duration was significantly higher than the mean TOX duration, indicating a greater amount of time without significant symptoms prior to progression compared to time with AEs.

Table 2 Mean duration of TOX, TWiST, and REL between treatment groups at 12 months for nal-IRI+5-FU/LV vs 5-FU/LV in the ITT cohort

Q-TWiST (NAL-IRI+5-FU/LV vs 5-FU/LV)

In the base case, when the utility weights for the TOX and REL health states were set to 0.5 in the intent to treat population, there was a statistically significant 1.3-month gain (95% CI, 0.4–2.1 months) in Q-TWiST favouring nal-IRI+5-FU/LV (5.1 months (95% CI, 4.5–5.8 months)) over 5-FU/LV alone (3.9 months (95% CI, 3.3–4.5 months)). This translated to a relative improvement of 23.8% at 12-month follow-up. In the per-protocol population, the results remained in favour of nal-IRI+5-FU/LV over 5-FU/LV alone (difference in Q-TWiST: 1.8 months (95% CI, 0.7–3.0 months)) (Table 3).

Table 3 Quality-adjusted time without symptoms or toxicities threshold analysis for nal-IRI+5-FU/LV vs 5-FU/LV in the ITT cohort

In the threshold analysis, the absolute gain in Q-TWiST at 12-month follow-up showed that the gain in Q-TWiST increased from 0.9 to 1.7 months as U(TOX) and U(REL) increased from 0.0 to 1.0 (Figure 1A). The Q-TWiST gains were all statistically significant across all combinations of utilities in TOX and REL while TWiST was set at 1.0, except when utility of REL was close to 1.0 and utility of AE was close to 0.0. Similarly, the relative improvement ranged from 16.9 to 30.8% (Figure 1B).

Figure 1
figure 1

Utility threshold plot for Q-TWiST (nal-IRI+5-FU/LV vs 5-FU/LV) at 12 months in intent to treat cohort. (A) The x-axis represents the utility for time after disease progression (REL) and y-axis represents the utility for TOX time. Both U(TOX) and U(REL) vary from 0 to 1, with U(TWiST) held at 1. The diagonal bands of different colours represent varying Q-TWiST gains by level of utility for TOX and REL. To understand the Q-TWiST gain associated with a given combination of U(REL) and U(TOX), one must select the corresponding values of U(REL) and U(TOX) on the x-axis and y-axis, respectively. The intersection of these two values inside the plot indicates which band of Q-TWiST gain the result from this combination belong. (B) The x-axis represents the utility for time after disease progression (REL) and y-axis represents the utility for TOX time. Both U(TOX) and U(REL) vary from 0 to 1, with U(TWiST) held at 1. The diagonal bands of different colour represent varying relative Q-TWiST gains by level of utility for TOX and REL, calculated by dividing the absolute Q-TWiST gain by the mean survival in the 5-FU/LV group. To understand relative Q-TWiST gain associated with a given combination of U(REL) and U(TOX), one must select the corresponding values of U(REL) and U(TOX) on the x-axis and y-axis, respectively. The intersection of these two values inside the plot indicates which band of relative Q-TWiST gain the result from this combination belong. *Not significant when utility of relapse is close to 1 and utility of AE is close to 0. REL=relapse; TOX=toxicity.

In a sensitivity analysis applying literature-based mPAC utilities to each health state (U(TOX)=0.65, U(REL)=0.75, and U(TWiST)=0.80), the Q-TWiST gain in nal-IRI+5-FU/LV over 5-FU/LV remained statistically significant at 1.1 months (95% CI, 0.3–1.9 months), with a relative gain of 20%. Another sensitivity analysis was performed by removing neutropenia from the TOX time calculation, and the result remained similar to the main analysis (difference in Q-TWiST: 1.3 months (95% CI, 0.4–2.2 months)). The Q-TWiST at various time points during follow-up consistently showed significant improvement in the nal-IRI+5-FU/LV group over the 5-FU/LV group, except at the first 3 months (Figure 2).

Figure 2
figure 2

Differences in Q-TWiST between nal-IRI+5-FU/LV vs 5-FU/LV at 3, 6, 9, 12 and 15 Months of Follow-Up.

In another sensitivity analysis including grade 2 toxicities, it was found that patients treated with nal-IRI+5-FU/LV experienced longer quality-adjusted survival time compared to 5-FU/LV (difference in Q-TWiST: 0.9 month (95% CI: 0.1, 1.7)), which corresponds to a 17.5% improvement. This sensitivity analysis result remained consistent with the main analysis, which included grade 3 toxicities.

In subgroup analyses, the results across all pre-specified subgroups were in favour of nal-IRI+5-FU/LV over 5-FU/LV alone (Figure 3). These positive differences were significant for patients of Asian ethnicity, patients with Karnofsky performance status scores <90, <1.3 months since prior treatment, stage IV cancer at diagnosis, and tumour location other than the head of pancreas.

Figure 3
figure 3

Quality-adjusted time without symptoms of disease progression or toxicity differences in months (5-FU/LV vs nal-IRI+5-FU/LV) among subgroups. KPS=Karnofsky performance status; tx=treatment.

Discussion

This analysis is the first to use the Q-TWiST approach to assess the efficacy of nal-IRI+5-FU/LV combination treatment vs 5-FU/LV treatment alone in patients with metastatic pancreatic cancer. Within the NAPOLI-1 Q-TWiST analyses, nal-IRI in combination with 5-FU/LV provided significantly greater quality-adjusted survival time compared to 5-FU/LV alone in patients previously treated with gemcitabine-based therapy. This result reflects both the significantly greater median survival time (6.1 vs 4.2 months; HR=0.67 (95% CI 0.49–0.92), P=0.012) and progression-free survival time (3.1 vs 1.5 months HR=0.56 (95% CI 0.41–0.75), P=0.0001) observed for nal-IRI+5-FU/LV treatment over 5-FU/LV alone within the NAPOLI-1 trial (Wang-Gillam et al, 2016). This also supports the original trial’s finding that despite patients receiving more medicine in the combination arm, quality of life (measured using the European Organization for Research and Treatment of Cancer Quality-of-Life Core Questionnaire (EORTC-QLQ-C30)) was not appreciably different between these two treatment groups (Wang-Gillam et al, 2016).

The Q-TWiST findings were consistent across the threshold analysis and various sensitivity and subgroup analyses. Most clinical trials do not collect utility data by health states, such as toxicities, as described in this study. The threshold analysis in Q-TWiST method allows TOX and post-progression utilities to vary across their possible ranges (0–1), addressing some of the uncertainty associated with the assigned utility values. Additionally, these Q-TWiST data may help inform physicians in discussing quality-of-life issues with pancreatic cancer when considering second-line chemotherapy in patients previously treated with gemcitabine-based therapy. In this study, the absolute gain and relative gain in Q-TWiST ranged from 0.9 to 1.7 months and from 17 to 31% in the threshold analysis, respectively. The results confirmed the robustness of the results given varying utilities assigned to TOX and disease symptoms or progression. Additional sensitivity analysis was performed using alternative literature utilities (i.e., prospectively collected utilities in pancreatic cancer patients): Utility for TWiST was 0.8, REL was 0.75, and TOX was 0.65. The results showed a statistically significant and 20% relative Q-TWiST gain for nal-IRI+5-FU/LV vs 5-FU/LV, which further supports the study conclusion. Five out of the nine subgroups had significant Q-TWiST gains by nal-IRI+5-FU/LV vs 5-FU/LV, including baseline Karnofsky performance status <90, time since treatment at baseline <1.3 months, stage IV cancer, Asian ethnicity, and tumour location other than head. It showed that patients with poor performance status or more advanced disease at baseline were more likely to benefit in Q-TWiST from the treatment of nal-IRI+5-FU/LV (vs 5-FU/LV).

The statistically significant Q-TWiST gains are relatively large in magnitude and may be considered clearly clinically important using the thresholds established by Revicki et al (2006), which state a clinically important improvement for Q-TWiST as 10% of OS in the control group, while 15% indicates a clearly clinically important improvement. In contrast, Solem et al (2014) found in a recent review of Q-TWiST literature in oncology that only 18% of Q-TWiST studies reported a clearly clinically important gain (i.e., 15%) in treatment over control groups in base case analyses.

A recent analysis assessing nab-paclitaxel plus gemcitabine combination treatment vs gemcitabine-only in first-line mPAC (Reni et al, 2014) found a 21% relative improvement in Q-TWiST in favour of the combination treatment over gemcitabine-only treatment. The Q-TWiST improvement observed in the current analysis for nal-IRI+5-FU/LV (vs 5-FU/LV) was 24%. The difference in outcomes may be partially attributable to the study design and treatments; while Reni et al (2014) analysed first-line treatment on treatment-naive patients, the current analyses only included patients previously treated with gemcitabine-based regimen.

The present analysis has several limitations. First, patient-derived data regarding the utility weights were not collected prospectively. To account for this, the base case utilities for TOX and REL were both set to 0.5, consistent with numerous previous studies. A utility threshold analysis was also applied with additional sensitivity analyses to assess how the Q-TWiST value would change under alternate assumption scenarios of individual patient preference. Second, the same utility was assumed regardless of the severity and type of AE (provided grade was 3). Neutropenia is one of the most common AEs observed in the clinical trial and it is often asymptomatic in nature. Therefore, additional sensitivity analysis with and without neutropenia was calculated, which revealed negligible differences in results.

Conclusion

Applying the Q-TWiST methods to the NAPOLI-1 trial data, the combination of nal-IRI with 5-FU/LV resulted in a statistically significant and clinically meaningful gain in quality-adjusted survival vs 5-FU/LV alone. The results remained robust regardless of all possible combinations of utility weights assigned to the TOX time and disease symptom/progression time, which confirmed the favourable clinical risk benefit of nal-IRI+5-FU/LV in mPAC patients previously treated with a gemcitabine-based therapy. In addition, the magnitude of Q-TWiST gains (i.e., relative gains) is relatively large when compared to published Q-TWiST analyses in other cancers, and is considered clearly clinically important based on the published thresholds.