Background

Treatments recommended for cancer localised within the prostate gland include radical surgery, radiotherapy, and active monitoring/surveillance, where radical treatment is avoided or delayed unless/until the cancer shows signs of progression. The ProtecT randomised treatment trial showed there was no evidence of a difference in prostate cancer mortality at a median of 10 years’ follow-up between 3D-conformal radiotherapy with neo-adjuvant androgen deprivation therapy (ADT), radical surgery, and active monitoring in men with clinically localised prostate cancer.1 Men randomised to surgery and radiotherapy had half the rate of prostate cancer progression and metastasis compared with the active monitoring group, but they experienced greater levels of treatment side effects, including urinary incontinence, erectile dysfunction and bowel symptoms.2

In most high- and middle-income countries, healthcare priorities are considered in the context of treatment effectiveness and its cost. Of two trials that compared surgery with watchful waiting,3,4 only the Scandinavian Prostate Cancer Group-4 trial (SPCG-4), conducted in the pre-PSA (Prostate Specific Antigen) era, evaluated costs. SPCG-4 found costs 34% higher for surgery compared with watchful waiting at a median of 12 years’ follow-up. The higher costs for surgery related to the initial treatment costs.3 In the absence of more contemporary trial data, models have been developed to estimate cost-effectiveness of treatments but have produced conflicting results.5,6,7,8,9

This paper presents an individual patient data economic evaluation of the ProtecT trial in terms of costs to the UK NHS and Quality-Adjusted-Life-Years (QALYs) at a median of 10 years’ follow-up, the prespecified time point for the primary analysis.1

Methods

Study design and participants

The ProtecT trial’s design and protocol have been published elsewhere.1,10 In brief, men in nine UK centres who were detected and diagnosed with clinically localised prostate cancer following a programme of population-based PSA testing who met trial eligibility criteria were randomised to active monitoring (n = 545), surgery (n = 553), or radiotherapy with neo-adjuvant ADT (n = 545).

Resource-use data collection, coding, and valuation

A UK NHS perspective was used for the study; hence, health service resource-use data related to prostate cancer and treatments were recorded from randomisation until November 2015 (a median of 10 years’ follow-up). Resource-use relating to initial treatments and subsequent interventions were recorded onto study-specific proformas. Follow-up inpatient stays, outpatient, Emergency Department and, from April 2005, primary care attendances were recorded onto schedules at annual research nurse appointments with participants and from a hospital medical records review. To ensure data completeness, treatment proformas and annual schedules were compared, duplicate data dropped, and combined sources of data used. The trial database containing some clinical information was used for validation purposes. Supplementary Table 1 outlines how the resources were measured, coded and valued (2014–2015 UK £ prices) and Supplementary Table 2 outlines how missing resource-use and EQ-5D-3L (QALY) data were handled.

Outcome measurement

The outcome for the economic analysis is the QALY at a median of 10 years’ follow-up. As recommended by NICE, utility values were estimated from the Euroqol EQ-5D-3L questionnaire and associated societal UK utility tariffs.11 The EQ-5D-3L was participant-completed prior to diagnosis (baseline), at 6 months, and annually from each man’s randomisation date. Men who died were assigned a zero-utility value for the remaining years they could have been trial participants. Utility values were combined to estimate the number of QALYs for each participant using the area-under-the-curve approach.11

Analysis

The analysis compared the three groups as randomised, considering prostate cancer-related NHS costs in relation to QALYs for a median of 10 years. Costs and outcomes for the 10-year median analysis were discounted at 3.5%.12 Analyses were conducted in Stata 14.1.13 Each item of resource used was summed for each man and the mean resource-use calculated by category (e.g. inpatient stays for infection) and trial group, over the median 10-year period. Each item’s cost was calculated as the use (e.g. number of GP visits) multiplied by its unit cost and were summed annually, across time and by resource-use category for each participant.

Annual adjusted mean costs and QALYs were estimated using linear regression. The method of ‘seemingly unrelated regressions’ (SUR), which accounts for the correlation between costs and QALYs, was used to estimate total adjusted mean costs and QALYs.14 Costs and QALYs were adjusted for study centre, age, Gleason score (<7, 7, 8–10) and PSA at baseline, in keeping with the primary outcome analysis. QALYs were also adjusted for baseline utility.11 Across the three treatment groups, the adjusted mean costs and QALYs were compared to assess if any of the treatments were less effective and more expensive than the other treatments. If that was the case, then incremental cost-effectiveness ratios (ICERs) would not be estimated in relation to that treatment.15 Incremental adjusted mean costs and QALYs, bias-corrected and accelerated confidence intervals (to account for non-normal distributions), and ICERs were estimated using SUR and non-parametric bootstrapping (5000 model iterations). Regression outputs were used to estimate parametrically the incremental net monetary benefit (iNMB) statistic and associated confidence intervals at the UK NICE willingness-to-pay threshold of £20,000 per QALY.12

Cost-effectiveness acceptability curves (CEACs) were generated to explore sampling uncertainty in the cost-effectiveness estimates,16 presenting the probability that each trial group is the cost-effective option compared to the other two groups at a range of monetary values. To calculate the CEACs, individual net monetary benefit values were calculated at each willingness to pay per QALY threshold (£0–£100,000 at £1000 intervals). At each threshold, 5000 bootstrap model iterations of the adjusted linear regression models of the net monetary benefits were performed. The proportion of times (that is the probability) that each treatment group had the highest net monetary benefit at each threshold was then calculated and plotted to create the CEAC. One-way and scenario sensitivity analyses were used to account for methodological uncertainty or assumptions made during the study and analysis (see Supplementary Information). Exploratory subgroup analyses explored heterogeneity within the study population for age (<65 vs. 65+ years at randomisation); Grade group (1 vs. 2 and higher; and D’Amico risk classification (low vs. intermediate and high).17 These were chosen to reflect more relevant contemporary classifications of the prespecified subgroups in the ProtecT primary analysis.1

Results

The adjusted cost and QALY SUR analysis were based on 1101 (67%) of the 1643 men randomised into the ProtecT trial. More data were available for other analyses (see Table 1, Supplementary Tables 34 for sample sizes).

Table 1 Total unadjusted mean resource-use and cost by allocation arm.

Resource-use

Resources related to primary treatments varied by randomised groups (Table 1), with the radiotherapy group having more outpatient visits, colonoscopy and sigmoidoscopy procedures, and fewer inpatient stays. There were more primary care resources, biopsies, magnetic resonance imaging (MRI) scans, bone scans, and transurethral resections of the prostate (TURP) in the active monitoring group, and more infection and urinary sphincter-related inpatient stays in the surgery group.

Basecase results

Over the median 10 years’ follow-up, the total adjusted mean costs of the surgery (£7519: 95% CI £7099–£7940) and radiotherapy (£7361: 95% CI £6938–£7783) groups remained very similar. Both radical groups were more expensive than the active monitoring group (£5913: 95% CI £5494–£6332). Year 1 adjusted mean costs were also similar for surgery (£4898) and radiotherapy (£4708), but much lower for active monitoring (£1166 (Supplementary Table 3; Fig. 1)).

Fig. 1
figure 1

Line charts displaying for each annual follow-up time point, by allocation arm (Active monitoring, Radiotherapy, Radical prostatectomy) the (a) mean adjusted annual costs and (b) mean adjusted annual QALYs (see Supplementary Tables 3 and 4 for sample sizes).

The total adjusted mean QALYs were similar for all groups—just slightly higher for those allocated to radiotherapy (7.093 QALYs: 95% CI 6.914–7.273) compared with active monitoring (6.976 QALYs: 95% CI 6.798–7.154) and surgery (6.909 QALYs: 95% CI 6.731–7.087) (Table 2).

Table 2 Cost-effectiveness results.

The result that the radiotherapy group was more expensive than the active monitoring group but had slightly higher QALYs meant that it was appropriate to estimate an ICER. The ICER of £12,310 per QALY showed that radiotherapy was the cost-effective option at the standard UK NICE willingness-to-pay threshold of £20,000 per QALY. The CEAC (Fig. 2), reflecting sample uncertainty, shows at this threshold the probability that radiotherapy is the cost-effective option is 58%, with a probability of 32% for active monitoring and 10% for surgery.

Fig. 2
figure 2

Cost-effectiveness acceptability curve displaying the probability of each treatment (Active monitoring, Radiotherapy, Radical prostatectomy) being the cost-effective option at different willingness-to-pay thresholds.

Sensitivity analyses

Sensitivity analyses (Supplementary Table 5) confirmed these results for the most part, except when QALY data were not imputed, or when only including participants following the introduction of primary care data collection, or excluding men recruited during the feasibility period; and the scenario using newer techniques (robotic surgery and intensity-modulated radiation therapy (IMRT)). In these four analyses, radiotherapy would be cost-effective at the higher UK-NICE willingness-to-pay threshold of £30,000 per QALY.

Subgroup analyses

Subgroup analyses (Table 3) showed that QALYs were again similar between groups, but costs varied. For example, for younger men and those with lower risk disease (D’Amico classification), active monitoring was less costly and had higher QALYs than the radical groups and so was most likely to be the cost-effective option. However, for older men and those with higher risk disease, radiotherapy was most likely to be the cost-effective option. At the £20,000 per QALY threshold, the probability that active monitoring was the cost-effective option was 80% for younger men, 84% for D’Amico low-risk disease, and 66% for Grade group 1; and that radiotherapy was the cost-effective option was 73% for older men, 69% for D’Amico intermediate/high-risk disease, and 75% for Grade group 2 or higher.

Table 3 Subgroup cost-effectiveness results.

Discussion

In this cost-effectiveness analysis of the ProtecT treatment trial, there were remarkably small differences between the treatment groups at a median of 10 years. Overall, radiotherapy had the greatest probability of being the cost-effective option for localised prostate cancer at a median of 10 years follow-up at the UK NICE willingness-to-pay threshold of £20,000 per QALY and was most likely to be the cost-effective option at all higher thresholds.

The marginally higher QALYs in the radiotherapy group meant that although it was more costly than the active monitoring group, it was more likely to be the cost-effective option. While the subgroup analyses for both the higher risk, and 65 years and older groups mirrored these results with greater certainty, for younger men and/or those with low-risk disease (defined by the D’Amico classification) active monitoring was more likely to be the cost-effective treatment group.

Over the median 10-year period, the mean cost-difference between the radiotherapy and surgery groups was only £159. The higher surgery group cost was robust to all but the sensitivity analysis that costed robotic surgery and intensity-modulated radiotherapy, and in the subgroup analyses it was lower for younger men and the low-risk groups (range: £127–£244). Throughout all the analyses the surgery group had fewer QALYs than the radiotherapy group; the difference in QALYs would translate into a reduction of 67 days in the best imaginable health over a median of 10 years.

The finding that the active monitoring group had the lowest costs may address clinician concern that active surveillance could be more expensive in the long term because of the need to keep monitoring patients who might ultimately end up having treatment. This study shows that at a median 10 years this was not the case.

The uncertainty reflected in only a 58% probability that the radiotherapy group was the cost-effective option needs to be acknowledged; the result indicates that it is inconclusive as to which treatment over the median 10 years would be the best value for money. For older men and/or those with higher risk disease, there was more certainty that the radiotherapy group was the cost-effective option; similarly, there was more certainty that the active monitoring group was the cost-effective option for younger men and/or those with low-risk disease—although these findings, being from subgroup analyses were conducted on smaller numbers and therefore need to be interpreted with some caution.

This study is the first and only economic evaluation within a randomised trial comparing the three contemporary major treatment modalities (ProtecT). The only previous randomised evidence came from SPCG-4, comparing surgery with watchful waiting—not active monitoring, and in the pre-PSA era. This showed higher costs in the surgery arm.3 The findings from this study are discordant with some modelling studies. To facilitate comparisons, published costs from the modelling studies were inflated to 2015 prices18 and converted to relevant currencies using 2015 purchasing power parities.19 Two lifetime modelling studies based on 65-year-old men indicated that active surveillance was the most cost-effective strategy compared with radical prostatectomy (7.6 QALYs: €9585 (£9260 vs. 7.56 QALYs: €16,468 (£15,911)8 and 8.85 QALYs: US$39,894 (£28,796) vs. 7.95 QALYs: US$38,180 (£27,559)6) and IMRT (8.10 QALYs: US$48,699 (£35,152)6]. In two models using SPCG-4 data, surgery was shown to be cost-effective, although it was more expensive than watchful waiting for 65-year-old men (ICER: 58,045 SEK (£4957) per QALY)20 and when an active surveillance programme was considered (ICER: NZ$33,160 (£16,070)).9 The limited availability of outcome data has been acknowledged to have been a weakness of the models produced.6,8

The main strength of this study is that it is based on individual patient data from the ProtecT randomised trial comparing the three contemporary major treatment modalities over a 10-year median duration. The use of medical records in conjunction with a participant visit meant that hospital-based missing data were minimised and likely to be missing completely at random. The presentation of resources used and their costs, and the use of different willingness-to-pay per QALY thresholds, means that country-specific costs and thresholds can be applied to these data.

Limitations of the study include changes that have occurred during the study’s duration. New techniques such as robot-assisted surgery and IMRT have become more prevalent and are potentially more expensive; in addition, there has been a global shift to shorter treatment courses of radiotherapy. A sensitivity analysis in which surgery was costed as robot-assisted, and radiotherapy as intensity-modulated radiotherapy meant that the radiotherapy group was only cost-effective at the higher £30,000 per QALY threshold; however, reducing the number of fractions delivered led to the radiotherapy arm becoming more cost-effective. Active surveillance programmes have also changed over time. Improved diagnostic techniques (mpMRI) might enable better selection of patients, which could lead to fewer changing to radical treatments or developing metastases, potentially lowering costs and improving effectiveness.

Another important limitation relates to the small number of events in the ProtecT trial to date because of the long natural history of prostate cancer, meaning that longer follow-up is needed to establish whether there are differences in mortality between the groups—which could change the balance of cost-effectiveness. In the late 1990s when the ProtecT trial was being designed, a 10-year time horizon was expected to show differences in clinical events. The exploratory subgroup analyses suggested differences in costs and effectiveness between age and prostate cancer risk groups. It would thus be premature to imply ruling out any of the treatment options at the present time, particularly following the 23-year follow-up of the SPCG-4 trial demonstrating improved survival in men receiving surgery compared with watchful waiting.21 Longer-term follow-up is needed in ProtecT to ascertain survival differences.

Other limitations relate to data collection and analysis. Primary care resource-use was not recorded in the follow-up schedules at the beginning of the trial. Therefore, all neo-adjuvant ADT treatment and PSA tests recorded in the trial database were costed. A sensitivity analysis including only participants with all primary care visits recorded indicated that the radiotherapy group was only cost-effective at the higher £30,000 per QALY threshold. Missing data occurring within the follow-up proformae were imputed following discussions with clinical staff, and a researcher blinded to treatment allocation removed duplicated events between sources, which could have led to small errors. Incomplete EQ-5D-3L data meant QALYs could only be calculated for 69% of the sample and could have been missing not at random, for example due to ill health. There is a suggestion that participants missing from the surgery group may have been less healthy at baseline than those missing in the other two arms (Supplementary Table 6), potentially leading to an overestimate in the mean QALY reported for the surgery group, but not changing the overall findings.

Conclusion

In the primary economic analysis at a median of 10 years, radiotherapy was the most likely cost-effective option because of slightly lower initial costs and slightly more QALYs, but this result was not conclusive. In subgroup analyses, there was more certainty that radiotherapy was the cost-effective option for older men and/or those with intermediate/high-risk disease. Active monitoring was the least costly option overall, and subgroup analyses suggested it was the most likely cost-effective option for younger men and/or those with low-risk disease. This evaluation provided evidence of the cost-effectiveness of the major primary treatment modalities for clinically localised prostate cancer over a median 10-year period. Further follow-up and subsequent modelling are required to compare other types of treatments, including different radiotherapy modalities, and to assess the impact of later stages of progressing disease to establish which treatment might be most cost-effective over a man’s lifetime. In the meantime, all treatments should continue to be offered, with patients, clinicians, and policy-makers using these results in combination with the evidence from ProtecT in relation to the trade-offs between disease progression, metastases, and urinary, sexual and bowel function.