Main

Endometrial cancer (EC) is a major cause of mortality for patients worldwide. Although its incidence differs throughout the world, it is estimated to be the most common cancer of the female genital tract and the fourth most common cancer in North America and Europe (Jemal et al, 2010; Colombo et al, 2013).

Early-stage EC restricted to the uterus represents nearly 80% of all cases (Creasman et al, 1987, 2006; Colombo et al, 2013). The estimated 5-year overall survival for these patients is 95% but decreases substantially to 67.0% and 15.9% for local and distant disease, respectively (Creutzberg et al, 2000a; Randall et al, 2006; Benedetti Panici et al, 2008; ASTEC study group et al, 2009). Moreover, the recurrence rate for early-stage EC is widely variable ranging from 2 to 26% (Creutzberg et al, 2000a; Benedetti Panici et al, 2008; ASTEC study group et al, 2009; Nout et al, 2010; Todo et al, 2010; Nugent et al, 2012). In this specific setting, many epidemiological and histological factors such as increasing age, depth of myometrial invasion, histological tumour type and grade, presence of lymphovascular space invasion (LVSI) and the International Federation of Gynecology and Obstetrics (FIGO) classification (Pecorelli, 2009) have been reported to be correlated with a higher risk of recurrence and nodal metastases (Creasman et al, 1987; Mariani et al, 2002; Keys et al, 2004; Nout et al, 2010; Todo et al, 2010; Nugent et al, 2012; Colombo et al, 2013).

Over the last decade, these criteria have been aggregated into several risk stratification systems (RSS) that are currently used worldwide to guide decision-making and clinical trial design (Creutzberg et al, 2000a; Keys et al, 2004; Todo et al, 2010; Colombo et al, 2013; Bendifallah et al, 2014). The assumption is based on defining recurrence risk groups, which can help identify clinical situations where multimodality therapy and/or nodal staging should be proposed for high-risk patients or, conversely, single modality or wait-and-see strategies for low-risk patients. Although the core variables of these RSS are very similar (Creutzberg et al, 2000a; Keys et al, 2004; Todo et al, 2010; Colombo et al, 2013; Bendifallah et al, 2014), finally, it appears that for major RSS: (i) most have never been externally validated; (ii) accuracy is not reported and (iii) no simultaneous comparisons using the same cohort have been performed.

Hence, the aim of this study was to compare five major RSS (Creutzberg et al, 2000a; Keys et al, 2004; Todo et al, 2010; Colombo et al, 2013; Bendifallah et al, 2014) in a multicenter cohort of patients with early-stage EC with regard to their discriminative performance in stratifying the risk of recurrence and nodal metastases.

Materials and Methods

Study population

The data of 553 patients with apparent early-stage EC, who received primary surgical treatment between January 2001 and December 2012, were abstracted from five institutions with maintained EC databases in France (Tenon University Hospital, Reims University Hospital, Dijon Cancer Center, Creteil hospital and Jeanne de Flandre University Hospital) and from the Senti-Endo trial (Ballester et al, 2011). All patients had undergone a preoperative endometrial biopsy. All enrolled patients underwent a preoperative MRI unless contraindicated, in which case a CT scan was performed. Patients with histologically proven EC were staged on the basis of final pathological findings according to the 2009 FIGO classification (Pecorelli, 2009). Clinical and pathologic variables included patient age, surgical procedure, 2009 FIGO stage and final pathological analysis (histological type and grade, depth of myometrial invasion and LVSI status). A tumour was considered LVSI-positive when tumour emboli were found within a space clearly lined by endothelial cells (Tsuruchi et al, 1995). The research protocol was approved by the institutional review board of the French college of obstetricians and gynecologists (CEROG 2014-GYN-020).

Treatment and follow-up

We included all women who underwent primary surgical treatment including at least total hysterectomy with bilateral salpingo-oophorectomy, with or without nodal staging (pelvic ± paraaortic lymphadenectomy) according to the current guidelines (Querleu et al, 2011; Colombo et al, 2013) and to the surgeon’s discretion. Sentinel lymph node biopsies (SLNB) were performed by a dual intracervical injection based on the histological validation of SLN by Delpech et al (2007). A para-aortic lymphadenectomy was recommended for women with metastatic pelvic SLN on intraoperative histology or after final histology. Systematic pelvic and para-aortic lymphadenectomy was also recommended for patients with type 2 EC (clear-cell, serous EC and carcinosarcoma) and type 1, grade 3 with a depth of myometrial invasion >50%. Adjuvant therapy was administered on an individual basis at the discretion of a multidisciplinary committee according to international guidelines (Colombo et al, 2013) and involved vaginal brachytherapy and/or external beam radiotherapy (EBRT) and/or chemotherapy. Clinical follow-up consisted of physical examinations and the use of imaging techniques depending on the findings. Follow-up sessions were conducted every 3 months during the first 2 years, every 6 months during the following 3 years and once a year thereafter.

RSS description

Five major RSS related to the risk stratification of early-stage EC were identified in the medical literature using PubMed: the Post Operative Radiation Therapy in Endometrial Carcinoma (PORTEC)-1 classification (Creutzberg et al, 2000a), the Gynecologic Oncology Group (GOG) 99 classification (Keys et al, 2004), the Survival effect of para-aortic lymphadenectomy (SEPAL) in EC classification (Todo et al, 2010), the ESMO (Colombo et al, 2013) and ESMO-modified (Bendifallah et al, 2014) classifications. RSS were selected with respect to their acceptance in the literature and clinical applicability. Table 1 describes the criteria for each RSS.

Table 1 Description of five risk recurrence systems

Recurrence events and recurrence-free survival (RFS)

The clinical end point was recurrence. Disease recurrence was diagnosed by biopsy or imaging studies and defined as a relapse without differentiating between their local or distant nature. RFS was defined as the time from surgery to the date of recurrence. Estimates were produced using the Kaplan–Meier method.

Statistical analysis

Stratification accuracy

The receiver operating characteristic area under the curve (ROC-AUC) as well as the concordance index (C-index) indicate the discriminatory properties and quantify the stratification accuracy (i.e., whether the relative ranking of individual stratification was in the correct order) (Hanley and McNeil, 1982; Heagerty et al, 2000; Heagerty and Zheng, 2005). The AUC requires binary outcomes (presence or absence of the event) and is reserved for binary logistic regression models. The c-index represents an adaptation of the AUC for censored data and is necessary when time-to-event data are used. In the current analysis, the accuracy of each RSS for RFS (censored data) was conducted using the Cox Proportional Hazards Model. Similarly to quantify the discriminatory properties of each RSS with regard to the risk for LNM, a binary logistic regression model was performed. The AUC, as well as the c-index of 0.5, represents no discriminating ability, and a value of 1.0 represents perfect discrimination.

RSS diagnostic accuracy

Sensitivity, specificity, negative predictive values, positive predictive values and the overall diagnostic accuracy (ODA) (i.e., the probability of a patient being correctly classified by the RSS) with 95% CI were calculated to study the diagnostic ability of each RSS to classify patients at low risk and those at high risk of recurrence and nodal metastases.

Others analysis

Statistical analysis was based on Student's t-test and the Mann–Whitney test for parametric and nonparametric continuous variables, respectively, and the χ2-test or Fisher's exact test, as appropriate, for categorical variables. Values of P<0.05 were considered to denote significant differences. Data were managed with an Excel database (Microsoft, Redmond, WA, USA) and analysed using R 2.15 software, available online.

Results

Characteristics of the study population

During the study period 553 patients with EC were documented as having received primary surgical treatment according to the following distribution: Dijon Cancer Center (n=122; 22%), Creteil Hospital (n=83; 15%), Reims University Hospital (n=87; 16%), Tenon University Hospital (n=70; 13%), Jeanne de Flandre University Hospital (n=97; 17%) and Senti-Endo trial (n=94; 17%). The demographics and clinicopathological characteristics of the whole cohort are reported in Table 2. The median age of the patients was 65.0 years (range: 31–98 years).

Table 2 Characteristics of the whole population

RFS according to each RSS

The median follow-up was 32 (range: 2–165) months and the median time to initial recurrence was 29 (range: 1–165) months. Overall 3-year RFS and 3-year recurrence rates were 83.9% (95% CI, 80.6–87.4) and 16.4%, respectively. Loco-regional, nodal and distant recurrences were observed in 20% (18/91), 24% (22/91) and 56% (51/91) of cases, respectively. The respective 3-year RFS according to each RSS are reported in Figure 1.

Figure 1
figure 1

Discrimination of each RSS for recurrence and nodal metastases.

Discrimination and diagnostic accuracy of each RSS system for recurrence

The discrimination of each RSS is reported in Figure 2A. The RSS with the highest discrimination was the ESMO-modified classification (C-index=0.73 (95% CI, 0.70–0.76)). The diagnostic accuracy of each RSS is reported in Table 3. The RSS with the highest ODA to select patients at low risk of recurrence was the PORTEC-1 classification with 56% of patients correctly stratified. The RSS with the highest ODA to select patients at increased risk of recurrence was the ESMO-modified classification with 78% of patients correctly assigned.

Figure 2
figure 2

Recurrence-free survival curves according to each risk classification.

Table 3 Diagnostic accuracy for recurrence

Discrimination and diagnostic accuracy of each RSS systems for nodal metastases

Overall, 86.6% (479/553) of the patients underwent systematic nodal staging and 16.5% (79/479) of these had nodal metastases (Table 2). Discrimination of each RSS is reported in Figure 2B. The RSS with the highest discrimination was the ESMO-modified classification (AUC=0.80 (95% CI, 0.78–0.82)). The diagnostic accuracy of each RSS is reported in Table 3. The RSS with the highest ODA to select patients at low risk of nodal metastases was the PORTEC-1 classification with 56% of patients correctly stratified. The RSS with the highest ODA to select patients at increased risk of metastases was the ESMO-modified system with 77% of patients correctly assigned.

Discussion

To our knowledge, this is the first study to provide a comparison of five major RSS applied to a multicenter population with early-stage EC. The results suggest that these five RSS have a poor-to-moderate discrimination for recurrence and nodal metastases. In addition, the clinical diagnostic accuracy to distinguish subgroups of patients at low- and high-risk of recurrence or nodal metastases appears to be limited and heterogeneous.

Management of women with early-stage EC remains controversial and practice patterns vary widely among gynecologic oncologists (Creutzberg et al, 2000a; Keys et al, 2004; ASTEC study group et al, 2009; Nout et al, 2010; Colombo et al, 2013; Ko et al, 2013). This is mainly because there are several criteria defining risk groups for recurrence, unstandardised protocols for surgical staging and different indications for adjuvant therapies (Creutzberg et al, 2000a; Keys et al, 2004; ASTEC study group et al, 2009; Nout et al, 2010; Colombo et al, 2013; Ko et al, 2013). To overcome these limitations and guide clinicians in their decision-making and in providing patient information, several authors have developed RSS to create a common nomenclature (Creutzberg et al, 2000a; Keys et al, 2004; Creasman et al, 2006; Querleu et al, 2011; Colombo et al, 2013; Bendifallah et al, 2014). Although all of these RSS include similar variables, the combination of variables differs substantially between the United States and European countries leading to widely differing practice patterns for adjuvant therapies and indications for nodal staging (Creutzberg et al, 2000a; Keys et al, 2004; ASTEC study group et al, 2009; Nout et al, 2010; Colombo et al, 2013; Ko et al, 2013). The potential of ROC curves in medical diagnostic testing was recognised as early as 1960 (LUSTED, 1960) as the most relevant statistical tool to describe diagnostic performance (Hanley and McNeil, 1982; DeLong et al, 1988). Classically, the predictive accuracy of a classification is based on the assumption that all patients within a given risk group are equal. However, in practice, heterogeneity in both biological parameters and patients’ characteristics within each risk subgroup has been reported, especially for women with early-stage EC (Creutzberg et al, 2000a; Keys et al, 2004; Ballester et al, 2011, 2013; Nugent et al, 2012), leading to incorrect risk assignment. Our results confirm that the ESMO-modified classification (Bendifallah et al, 2014) was the RSS with the highest discrimination according to recurrence with a C-index of 0.72. We also found that the PORTEC-1 classification (Creutzberg et al, 2000a) was the most accurate in selecting patients at low risk of recurrence with an ODA of 55% and the ESMO-modified classification (Bendifallah et al, 2014) more accurate in selecting patients at increased risk with an ODA of 78%. These results also suggest that these RSS are heterogeneous in terms of classification performance. Moreover, it highlights the high rate of misclassified patients whatever the RSS used and the potential risk of inadequate surgical staging and over- or under-treatment. Finally, these results underline that new biological markers or stratification tools are probably needed to improve discrimination of such classifications, resulting in a more adapted surgical staging and adjuvant treatment.

Despite a reported good overall survival, almost 15% of patients with localised disease experience recurrence during the first 2 years following initial treatment (Creasman et al, 2006; Benedetti Panici et al, 2008; ASTEC study group et al, 2009; Bendifallah et al, 2014). It is therefore essential to distinguish patients at increased risk of recurrence who require systematic adjuvant EBRT and/or chemotherapy. A debate exists regarding the optimal adjuvant therapy for patients with early-stage EC. Published trials involve a wide variety of patients with different characteristics, rendering interpretation of the results somewhat difficult (Creutzberg et al, 2000a; Keys et al, 2004; Nout et al, 2010; Ko et al, 2013). Moreover, there are several differences in surgical staging from one study to another; in some trials, lymphadenectomy was systematically performed (Kuoppala et al, 2008; Reed et al, 2008; Susumu et al, 2008), whereas in others it was not required (Creutzberg et al, 2000a; Maggi et al, 2006; Randall et al, 2006; ASTEC/EN.5 Study Group et al, 2009; Nout et al, 2010) or performed only in case of suspicious lymph nodes (Morrow et al, 1990; Sorbe et al, 2009, 2012). This gives rise to an important confounding bias. Three randomised trials on adjuvant pelvic radiation versus a wait-and-see approach have shown significantly improved loco-regional control in case of additional EBRT, with no impact on overall survival (Aalders et al, 1980; Creutzberg et al, 2000a; Keys et al, 2004). Indeed, when focusing on the high-risk cohorts, the reported loco-regional recurrence rates vary from 13 to 23% with no adjuvant EBRT (Aalders et al, 1980; Creutzberg et al, 2000a; Keys et al, 2004) versus 5% when adjuvant EBRT is administered systematically (Aalders et al, 1980; Creutzberg et al, 2000a; Keys et al, 2004). This underlines the importance of accurate risk stratification in selecting the most adapted treatment option. Similarly, few data exist on the role of chemotherapy in early-stage EC. In high-risk EC, the Cochrane meta-analysis showed a positive impact of chemotherapy on overall survival, disease-free survival and distant metastasis (Johnson et al, 2011). However, these results may be biased by the inclusion of patients with more advanced disease once again rendering interpretation somewhat difficult (Randall et al, 2006).

Selecting patients who might benefit from systematic nodal staging is a major issue to guide postoperative treatment in patients with early-stage EC (Benedetti Panici et al, 2008; ASTEC study group et al, 2009; Ballester et al, 2011; Nugent et al, 2012). In this setting, a meta-analysis of two randomised trials on the impact of systematic lymphadenectomy in early-stage EC showed no benefit on overall and recurrence-free survival (Benedetti Panici et al, 2008; ASTEC study group et al, 2009). In contrast, in the SEPAL study Todo et al (2010) reported a survival benefit for systematic pelvic and para-aortic lymphadenectomy especially in patients with intermediate- and high-risk EC. These results highlight that the intermediate-risk group, as currently defined by the major classifications, is a heterogeneous group of patients in terms of nodal metastases rendering indications for complete surgical staging and adjuvant therapies somewhat blurred. Moreover, in a retrospective study on the rate of nodal metastases in clinical stage 1 type 1 EC according to the PORTEC 1 (Creutzberg et al, 2000a) and GOG-99 criteria (Keys et al, 2004) for high-intermediate risk patients Nugent et al (2012) reported that patients have substantial risk of nodal involvement and recurrence, suggesting that complete nodal staging is crucial for this subgroup. Our results confirm that the ESMO-modified classification has the highest discrimination for nodal metastases. Moreover, we found that the PORTEC-1 RSS (Creutzberg et al, 2000a) was the most accurate to select patients at low risk with an ODA of 56%, whereas the ESMO-modified RSS (Bendifallah et al, 2014) was the most accurate to select patients at high-risk with an ODA of 77%. These results underline the need in the future for precise quantification of the risk of nodal metastases using a complementary approach based on individualized prediction models such as nomograms (Bendifallah et al, 2012; AlHilli et al, 2013). In this specific setting, AlHilli et al (2013) developed two nomograms in patients with surgically treated stage I–IV endometrioid EC to predict the probability of lymph node metastases. However, the definition of an optimal threshold to decide whether to perform secondary lymphadenectomy is lacking. Finally, the authors did not focus on women with early-stage disease, which is the subgroup with the most discrepancies in terms of nodal metastases.

Some limitations of the present study deserve to be mentioned. First, it included patients treated for early-stage EC over a relatively long period. During the data collection period, modifications in staging modalities (FIGO classification (Pecorelli, 2009)) and surgical techniques (LN staging) were introduced. For example, SLNB was introduced and shown to be a possible first-line treatment for patients with early-stage EC. Indeed, Raimond et al (2014) recently demonstrated that SLN mapping and ultrastaging improved staging and made it possible to adapt adjuvant therapy to the risk of recurrence. Second, our cohort included patients from several centers and discrepancies in patient management might have affected our results in part. However, all included centers were regional referral centers applying the current French guidelines. Third, although the ESMO-modified classification seems to be associated to higher stratification accuracy, an external and independent validation study of the current results is needed. Fourth, although the multicentre nature of this study provides an overview of clinical practice during a long period, the overall survival analysis could not be performed. Finally, central pathology review was not available. However, dedicated pathologists from tertiary referral centers assessed all biopsies and specimens.

In conclusion, we demonstrate here that none of five major RSS shows high accuracy to stratify recurrence risk and nodal metastases in women with early-stage EC. Therefore, there is a need to revisit existing RSS using additional tools such as biological markers to better stratify patient risk in this setting. Moreover, several promising prognostic in situ biomarkers such as DNA ploidy, expression of P53, oestrogen and progesterone receptors have been identified (Ballester et al, 2013; Murali et al, 2014). These biomarkers could be used in clinical practice for a more individualised management in EC. At last, the therapeutic challenge for early-stage EC lies in promoting a personalized therapeutic strategy to avoid over- or under-treatment.