Introduction

Prostate cancer (PCa) is the most common non-dermatologic cancer in the western countries in western countries. Epidemiological data show that its morbidity is approximately 0.214% in males and 192,000 individuals are diagnosed with PCa annually in the United States1.High-risk PCa accounts for 15% of the diagnosed cases2.The common treatment options for high-risk PCa are radical prostatectomy (RP), radiation therapy (RT), brachytherapy (BT) and androgen-deprivation therapy (ADT)3.

Although a consensus has not been reached for the definition of high-risk PCa, the D'Amico classification system is currently widely used for risk stratification in PCa. It comprises a three-point scale for recurrence and metastasis: low risk, medium risk and high risk. High-risk PCa is defined as a prostate-specific antigen (PSA) level > 20 ng/ml, a Gleason score of 8–10, or a clinical stage ≥ T2c according to the American Urological Association (AUA) guideline4,5. However, the definition in the guidelines of the European Association of Urology (EAU) and the National Comprehensive Cancer Network (NCCN) is PSA>20 ng/ml, Gleason score 8–10, or clinical stage ≥T3a6. In addition, the Radiation Therapy Oncology Group (RTOG) also defined high-risk PCa as PSA 20-100 ng/ml, Gleason score 8–10 and any clinical stage of pT or PSA<100 ng/ml, Gleason score 8–10 and clinical stage ≥T2c7.

Because high-risk PCa is prone to recurrence and metastasis after treatment, an increasing number of studies have focused on this issue. Unfortunately, there is no consensus regarding the optimal treatment choice8. In the current study, we performed a systematic review of the literature to compare the long-term survival outcomes of RP, RT, BT, ADT and watchful waiting (WW), alone or in combination, in patients with high-risk PCa.

Results

Study characteristics

A flowchart of the literature searches is shown in Figure 1. Of the 18 studies including 6986 patients, six compared different approaches without combined regimens (N = 3682) and the remaining 12 used combined regimens (N = 3304). The characteristics of the included studies are summarized in Table 1. Meta-analysis was performed only for the CSM of Zelefsky et al.16, Tewari et al.18 and Kibel et al.19 by STATA software version 12.0. Mantel-Haenszel fix effects model was used to estimate the CSM for the three studies for the I2 = 33.0% (P = 0.225). The pooled HR of CSM was (0.51 [95% CI 0.30–0.73], P<0.001) with a low heterogeneity (Figure 2).

Table 1 Characteristics of included studies (N = 18)
Figure 1
figure 1

Flowchart of literature searches.

Figure 2
figure 2

Forest plot of pooled hazard ratio (HR) for cancer-specific mortality (CSM).

All the studies were prospective or retrospective cohort studies, except for four RCTs12,13,14,15 and one case-controlled study22. The Jadad scale of each of the four RCTs was 3 points; therefore, the studies were considered to be high quality (Table 2). The NOS quality assessments of the 10 cohort studies showed satisfactory results, with star ratings of ≥7 (Table 3). The NOS score of the case-controlled study was 822.

Table 2 Quality assessments of RCTs with Jadad Score (N = 4)
Table 3 Quality assessments of cohort studies with Newcastle-Ottawa Scale (NOS) (N = 13)

Results of included studies

Studies without combined regimens (N = 6)

RP versus RT (N = 2)

Zelefsky et al. included 2380 patients with pathologically confirmed T1c-3b stage PCa, of whom 409 were high-risk16. Among the high-risk group, the 8-yr CSM was 3.8% for RP and 9.5% for RT (P = 0.015). The absolute difference (AD) between groups (using RP−RT) for 8-yr distal metastasis-free survival (DMFS) was higher in patients with high-risk tumors compared with those with intermediate-risk or low-risk tumors (high-risk, intermediate-risk and low-risk: 7.8%, 3.3% and 1.9%, respectively).

Merino et al. performed a retrospective cohort study17. Of the 1200 patients with clinically localized PCa, 294 patients were high-risk (216 in RP vs. 78 in RT). Stratified analysis revealed that RP resulted in a better outcome regarding 7-year OS (87.5% vs. 77.3%, P = 0.02); however, there was no significant difference between high-risk patients in CSS (85.4% for RT vs. 93.0% for RP, P = 0.07) and those in CSM (RT vs. RP: hazard ratio [HR] 1.71, P = 0.218).

RP versus RT versus WW (N = 1)

Tewari et al. included patients with high-risk localized PCa with Gleason 8–1018. Among these, 197 received WW, 137 received RT and 119 underwent RP. The risk of CSM in patients who underwent RP was 49% and 68% lower than that in patients treated with RT and WW, respectively (HR 0.51, P = 0.053; HR 0.32, P = 0.001, respectively). The difference between RT and WW was also significant (HR 0.64, P = 0.018).

RP versus RT versus BT (N = 2)

Kibel et al. did a prospective combined retrospective study of 10,429 patients with localized PCa19, of whom 1201 were high-risk. Of the high-risk patients, 525 underwent RP, 676 received RT and 33 received BT. Multivariate analysis revealed that the differences in OS were significant (RP vs. RT: HR 1.7, P = 0.001; RP vs. BT: HR 3.1, P < 0.001). However, there was no difference in CSM among the treatment groups (RT vs. RP: HR 1.3, P = 0.2; BT vs. RP: HR 1.6, P = 0.5). Unfortunately, the comparison of RT and BT groups was not available.

A study performed by Stokes et al. compared the long-term biochemical disease-free survival (BDFS) for patients undergoing RP, external beam radiotherapy (EBRT) and BT alone20. Of the 318 patients included, 268 were high-risk (RP vs. EBRT vs. BT: N = 134 vs. 95 vs. 39, respectively). There was no significant difference between BT and RT for 5-yr BDFS, although the former had a higher result. A significant improvement was observed with RP (RP vs. RT, RP vs. BT and RT vs. BT: P < 0.0001, P = 0.0136 and P = 0.1928, respectively).

RP versus RT versus ADT (N = 1)

The CaPSURE study was a prospective and retrospective cohort study, with a large sample size of localized PCa patients (N = 8982)21. Of the 7538 patients whose data were available, 5066 underwent RP, 1143 received RT and 1329 received ADT. The study made systemic predictions of the likelihood of OS, PFS and pathological stage according to the CAPRA (California San Francisco Cancer of the Prostate Risk Assessment) scoring systems (Table 1). After stratification using the CAPRA, the differences in HR (using RT−RP, ADT−RT, or ADT−RP) for 10-yr CSM increased with higher CAPRA scores (Table 4). Data revealed that the 10-yr CSM of RP and RT were similar in moderate-risk and low-risk patients. However, the 10-yr CSM in high-risk patients treated using RP was significantly lower than that in high-risk patients treated using RT.

Table 4 Results of high-risk group/subgroup of included studies (N = 18)

Studies with combined regimens (N = 12)

RP versus RP + ADT (N = 2)

Siddiqui et al. performed a case-controlled study, which enrolled 191 pT3bN0M0 patients who underwent adjuvant ADT (aADT) and then matched them (1:1) with a control group receiving RP alone22. Finallly, the RP + aADT group experienced improved 10-yr CSS (94% vs. 87%, P = 0.037). However, the 10-yr OS was similar between groups (75% vs. 69%, P = 0.12). A cohort study by Bastide et al. also compared the use of RP (N = 82) with RP plus aADT (N = 41) for pT3bN0M0 patients23. Multivariate analysis showed that the combined group had a lower PSA-biochemical recurrence rate (HR 0.64, P = 0.13).

RP + ADT versus RT + ADT (N = 1)

Koie et al. reviewed 329 high-risk localized PCa patients that were treated with RP plus nADT (N = 216) or RT plus nADT (N = 81)24. Propensity-score matching identified 78 matched pairs of patients with similar baseline data. Kaplan-Meier analysis showed that 3-yr OS was 98.3% and 92.1% in RP and RT groups, respectively (P = 0.156). There were also no significant differences in the 3-yr biochemical recurrence-free survival rates (BRFS) (RP vs. RT: 86.4% vs. 89.4%, P = 0.878).

RP versus RT+ADT (N = 2)

A recent study by Lee et al. included patients with high-risk localized PCa that was treated with RP (N = 251) or RT plus neoadjuvant ADT (nADT) (N = 125)25. Results showed that the 5-yr CSS was longer in patients treated with RP compared with the combined group (96.5% vs. 88.3%). Multivariate analysis showed that CSM increased significantly in the combined group than RP group (HR 3.22, P = 0.001). A similar study by Hsu et al. compared patients with T3a PCa, treated with RP (n = 200) or RT plus nADT (N = 35)26. The long-term survival outcomes were better for RP, although OS (95.9% vs. 79.8%, P = 0.21) and CSS (98.7% vs. 88.7%, P = 0.42) differences were not significant.

RP versus RT + BT + ADT (N = 1)

Westover et al. compared treatment using RP with EBRT + BT + (neoadjuvant + concurrent) ADT in patients with localized PCa with a Gleason score of 8–1027. Multivariate analysis showed that RP was not associated with an increased risk of CSM compared with the combination group (HR 1.8 [0.6–5.5], P = 0.3).

RT versus RT + ADT (N = 4)

A RCT conducted by Bolla et al. reported that RT plus 3-yr aADT resulted in a significantly better 5-yr OS than RT alone (79% for the combination vs. 62% for RT, P = 0.001)12. D'Amico et al. also performed a comparison between RT and RT plus 6-mo aADT13. Significant difference was also found for 5-yr OS (88% vs. 78%, P = 0.04). Pilepich et al. also reported a better CSS using RT plus aADT (63.5% vs. 48.2% P = 0.01) than RT alone14. Similarly, Miljenko, et al. revealed a better outcome using RT plus (n + c) ADT, although the difference was not significant (8-yr OS 38% vs. 31%, P = 0.98; 8-yr CSM 44% vs. 54%, P = 0.36)15.

RT + BT versus RT + BT + ADT (N = 2)

Galalae et al. included 611 patients with localized PCa28. Among these, 359 were at high-risk and were treated with RT and high-dose rate BT (HDR-BT) either combined with (N = 119) or without (N = 240) (n + c) ADT. Multiple regression analyses showed that the “no ADT” group had a better outcome regarding OS (87% vs. 80%, P = 0.057) and CSS (97% vs. 90%, P = 0.002). Demanes et al. performed a similar study of 113 patients with high-risk PCa29, with 65 and 48 in the “ADT” group (RT + BT + ADT) and “no ADT” group, respectively. Kaplan-Meier analysis demonstrated improved 10-yr PSA-progression-free survival (PSA-PFS) in patients in “ADT” group, but not significantly (70% vs. 62%, P > 0.05).

The results of each study are shown in Tables 4.

Discussion

Recent treatment options for high-risk PCa include RP, RT, BT, ADT, WW and combined schemes30. However, the optimal first-line treatment for clinically high-risk PCa remains controversial31. Current guidelines are inconsistent between the EAU, AUA and NCCN32,33,34. EAU and AUA preferred to choose RP as the first step for patients with high-risk PCa; EAU also suggested that ADT should be given to patients after RP. Conversely, the initial option recommended by NCCN was combined RT with ADT. Therefore, the current study focused on a comparison of all available approaches.

To our knowledge, this is the first systematic review comparing the long-term survival of patients with high-risk PCa treated with all available approaches (RP, RT, BT, ADT and WW, combined or alone). One of the most important outcomes of our systematic review of 18 longitudinal studies is that RP/(RT plus aADT) has the best survival outcome in patients with high-risk PCa. Conversely, WW had the worst outcome. The overall priority for treatment strategy could be ranked as follows: RP/(RT plus aADT), RT and ADT/WW. RP has significant better OS and BDFS than RT or BT. RP can significantly decrease 49% of the CSM than RT alone based on the meta-analysis results. Although ADT can improve the CSS of RP, we do not recommend the regimens of (RP+aADT) because this conclusion is based on a case-control study22; when comparing RP with (RT+ADT), added ADT to RT still fail to challenge the position of RP for the latter has significant lower CSM; ADT can improve the OS and CSS of RT significantly, but the outcomes are inconsistent when ADT is added to regimens of (RT+ BT). In brief, RP/(RT + aADT) is the appropriate first-line therapy regimen for high-risk PCa. For patients who can tolerate surgery, RP is the preferred choice; RP plus aADT regimen can be used for clinical T3b patients. For patients who are vulnerable to comorbidities, such as the elderly, the alternative is (RT+aADT); the conclusions of (RT + BT) with/without ADT are not inconsistent.

It seemed that consensus had been reached on this issue after the current systematic review that RP had better survival outcomes, but it did not mean that RP possessed the overwhelming advantage than other approaches, especially than RT plus aADT. In fact, the treatment option was affected by many different factors and thus it was unwise of urologists to select RP for all patients with high-risk PCa. For example, a 20-year follow-up study by Peter et al. analyzed 767 patients with localized PCa that were treated using observation or ADT35. Of the 138 deaths with a Gleason score of 8–10, 53% were due to PCa and 24% due to other causes (such as comorbidities including diabetes mellitus or hypertension). In addition, a cohort of localized PCa patients assessed by Lu-Yao et al. had a median age of 78 yr36 and was managed conservatively without surgery or radiation. After a median 8.3-yr follow-up, the 10-yr death rate for poorly differentiated subgroup was 25.6% due to PCa and 56.5% due to other causes. These two studies revealed that a competing medical condition could affect survival outcomes, particularly in elder patients who were vulnerable to comorbidities. In contrast, the possibility of complications, such as incontinence and erectile dysfunction, should be taken into account in younger individuals who are candidates for RP. Ward et al. reported a 15-yr study assessing RP use in patients presenting locally advanced (cT3) PCa37. Among these, 21% suffered incontinence after 1 yr and 75% had no erectile function after RP (only 26% with bilateral or unilateral nerve preservation). Therefore, although we had drew conclusion that RP had better survival outcomes, different treatment options should be considered according to the tumors, patient age, concomitant diseases and individual preferences.

In studies that included combined regimens, ADT was used widely to improve the survival outcomes of RP, RT, or BT. ADT use before surgery or radiotherapy was based on “the first hypothesis” that androgen ablation might reduce the tumor bulk and enhance tumor cell kill to enable total excision of the cancerous area, whereas ADT use after surgery or radiotherapy might eliminate residual tumor cells in the primary lesions and subclinical metastatic lesions (called “the second hypothesis”). Studies by Siddiqui et al.22 and Bastide et al.23 demonstrated that nADT before RP could improve long-term survival outcomes. However, both these studies included only high-risk patients with seminal vesicle invasion (pT3b); therefore, we cannot reach the same conclusion for localized PCa patients with clinical stage ≥ T2c, Gleason 8–10, or PSA>20 ng/ml. In addition, four RCTs with the addition of aADT to RT12,13,14,15 substantially verified “the second hypothesis”. But for “the first hypothesis”, it was not safe to draw a positive conclusion because the inconsistency of Galalae et al. and Demanes et al.28,29. Specifically, better outcome in the “no ADT” group in the study by Galalae et al. was unexpected28. One explanation for this is the obvious selection bias in the “ADT” group because ADT therapy was performed specially in patients with an enlarged prostate. An alternative explanation is that patients did not benefit from the short duration of ADT. Therefore, additional high-quality RCTs are needed to establish the value of nADT before RT plus BT for high-risk PCa.

Although the 14 large-scale longitudinal studies included in this systematic review provided evidence to allow robust conclusions to be drawn, the review has several limitations. First, many studies had limited methodological quality. The definition of high-risk PCa, RT doses and cycles, start time of follow-up and median follow-up durations varied among studies. This also made it no value to conduct a formal meta-analysis for most of the included studies and the available meta-analysis was conducted only based on three studies. Second, RCTs that compared the long-term survival of RP and other approaches directly in patients with high-risk PCa are still unavailable; most of the available studies included patients with localized PCa; therefore, high-risk patients were discussed as a subgroup. Much baseline data (e.g. mean age and mean PSA) were not reported, making comparisons among studies challenging. Finally, important issues such as perioperative complications and cost effectiveness of RP and RT were not assessed. Treatment costs should be considered, particularly in developing countries such as China. Therefore, further large scale, rigorous RCTs with consistent inclusion criteria, design and outcome measures are strongly desirable to ascertain the long-term outcomes, safety and cost-effectiveness of the different treatment approaches.

In summary, this systematic review provided strong evidence to support RP or RT plus adjuvant ADT as first-line therapy option for high-risk PCa. Although RP resulted in the best survival outcomes, it was not suitable for all patients. In patients who could tolerate surgery, RP is the best choice, whereas the alternative is RT plus aADT in patients who are vulnerable to comorbidities. Urologists should fully discuss all treatment options with the patient and specialists from other related disciplines and comprehensively consider various factors including the tumors and patient preferences. This would allow treatment benefits to be expanded to their fullest potential in all patients.

Methods

Search strategy

We searched for relevant studies (search date, July 12, 2014) using OvidSP to search three databases: Ovid MEDLINE® (1946 to present), EMBASE® (1974 to July 19, 2014) and the Cochrane Central Register of Controlled Trials® (June 2014). The search strategy was as follows: [(Prostatic Neoplasms or prostate cancer).sh. or (Prostate Neoplasm or carcinoma of prostate).tw.] and [(high-risk or High-grade).tw.] and [(prostatectomy or Radiation or Radiotherapy or Brachytherapy or watchful waiting or observation).sh. or (radical prostatectomy or radiation therapy or androgen-deprivation therapy or seed implantation or active surveillance).tw.]. The meaning of “sh” and “tw” were MeSH heading and text word, respectively. The reference lists of the included studies, other reviews and related articles not identified by our electronic searches were also screened for additional possible studies. Our literatures search had no language and publication status restrictions. The gender was limited to male. Two authors (Lin and Cao) then reviewed the titles, abstracts and the full text of each article, independently. Any disagreements were solved by discussion within the study group.

Study selection

Studies that met all of the following criteria were included: longitudinal studies a) in which the study population or subpopulation included high-risk PCa patients, b) using RP, RT, BT, ADT, or WW as the study variables or exposure variables and c) that reported quantitative end-points comparing the effect between or among RP, RT, BT, ADT, or WW [e.g., OS, CSS and cancer-specific mortality (CSM)] with at least three years' median follow-up. High-risk PCa was defined as clinical stage ≥T2c, Gleason score 8–10, or PSA > 20 ng/ml, with a negative computerized tomography or bone scan.

Data extraction

Data from the included studies were extracted by two reviewers (J.H. and D.H.) and cross-checked, respectively. Any disagreements were reconciled by a third person (L.R. or Q.W.). The following information was collected from the reports of original trials: first author, study design, sites, time, population, median follow-up, comparison of treatment, end-points and definition of high-risk PCa. Discrepancies were resolved in consulation with Q.W.

Quality assessment

The methodological quality of the included studies was evaluated according to the Jadad scale for randomized controlled trials (RCTs)9 and the Newcastle-Ottawa Scale (NOS) using a “star system” for cohort or case-controlled studies10,11. Scores ≥3 points and ≥7 points were considered high quality using the Jadad scale and NOS, respectively.

Statistical analysis

The log hazard ratio (HR) was chosen as the appropriate summary statistics because it was the only summary statistic that allows for both censoring and time to an event. Possible heterogeneity of studies was quantified using the chi-squared test and I2 value. If I2 < 50% or P value > 0.10 (considerable lower heterogeneity), the fixed effect model was chosen; otherwise the random effect model was used. An observed HR > 1 indicated a worse outcome for the positive group compared to the negative group and was considered significant if the 95% CI did not overlap 1. We followed the PRISMA statement for conducting a high-quality meta-analysis. All analyses were performed using STATA software version 12.0 (STATA Corporation, College Station, USA) analysis.