Main

Endometrial adenocarcinoma is a biologically diverse disease that has been divided into two subgroups based on histomorphology, biologic behavior, and underlying genetic aberrations. Type I tumors exhibit endometrioid morphology, arise in the setting of increased estrogen exposure with subsequent development of precancerous lesions (endometrial intraepithelial hyperplasia-EIN/atypical hyperplasia), are associated with frequent PTEN inactivation mutations, and have a favorable outcome.1, 2 By contrast, Type II tumors demonstrate papillary serous or clear cell morphology, are not associated with any known precursor lesions, demonstrate frequent p53 mutations, and are associated with a poor outcome.3, 4 The differences between these groups are summarized in Table 1. The observation that uterine papillary serous carcinomas have a greater tendency to metastasis and/or recurrence in the peritoneum underscores the importance of accurate diagnosis.5

Table 1 Pathogenetic subsets of endometrial carcinoma

Although the distinction between serous and endometrioid subgroups of endometrial cancer has a strong theoretical basis, subsets of endometrial adenocarcinoma have been recognized that do not fall cleanly into either of the two categories. Tumors with glandular architecture that more closely approximate endometrioid adenocarcinoma may occasionally demonstrate strong p53 positivity.6, 7 Alternatively, mixed pattern tumors exhibiting coexisting but immunophenotypically distinct serous and endometrioid patterns have also been encountered. Finally, recent reports have identified putative precursor lesions (endometrial glandular dysplasia) that are associated with uterine papillary serous carcinoma but are distinct from serous endometrial intraepithelial carcinoma.8, 9 Owing to the marked differences in outcome associated with serous morphology or p53 inactivation, the accurate assignment of tumor type has important clinical implications, both in terms of therapy and prognosis.10

The purpose of this study was to identify parameters that influence the accuracy and reproducibility of endometrial adenocarcinoma classification into prototypical Type I and II groups. Indications and expected results for p53 immunohistochemistry were compared for multiple observers with actual results. The ability of pathologists to consistently identify a mixed, or intermediate, class of lesions was directly measured, and the immunophenotype of this group examined.

Materials and methods

Case Selection and Review

In all 103 consecutive cases of endometrial adenocarcinoma were selected from the diagnostic files of Brigham and Women's Hospital for the period 1988–2002, including hysterectomies and endometrial biopsies. H&E slides were examined by three blinded independent reviewers (KL, MN, and GM) who were asked to score all cases using two classification systems:

Classification Systems

A classic ‘best fit’ diagnostic schema for classification employed a binary system where Type I corresponded to endometrioid adenocarcinoma and Type II to papillary serous carcinoma.

An expanded 3-class schema provided for an uncertain category in addition to Type I (endometrioid) and Type II (serous) tumors. Observers were permitted to select an indeterminate category, corresponding to patterns that were not easily categorized into either of the classic groups.

All cases identified by at least one pathologist as ‘Uncertain’ in the expanded schema were then resolved by consensus review into subgroups. These subgroups included specimens which were indeterminate due to technical reasons, carcinomas with discrete admixed or hybrid histologies, and other problems in classification.

Other Correlations

In addition to classifying the tumors into the aforementioned groups, the observers recorded their preference for performing immunostaining for p53 and were asked to predict the p53 immunostaining pattern.

Immunohistochemistry

All cases were immunostained for p53, results of which were scored by two other pathologists blinded to the previous histologic interpretations:

P53-negative

This group had <50% positive staining nuclei (Figure 1a).

Figure 1
figure 1

Patterns of immunostaining for p53. (a) Heterogeneous staining involving less than 50% (typically 0–15%) of tumor cell nuclei. (b) Homogeneous staining of over 50% of tumor cell nuclei. (c) Heterogeneous (upper) juxtaposed with homogeneous (lower) staining.

P53-positive

This group required one of the following: (1) 50% or more nuclei staining positive (Figure 1b), or (2) Two discrete geographic patterns in which one stained with 50% or more positive nuclei (Figure 1c).

Statistics

Concordance in histologic diagnosis between observers or between observers and immunohistochemistry were assessed using the kappa statistic. Values of k were interpreted as poor to fair (less than 0.4) good (0.4–0.55), very good (0.55–0.7) and excellent (greater than 0.7). The following correlations were made and are illustrated in the results in dendrograms signifying a hierarchical clustering algorithm using Ward's linkages in Euclidian space:

  1. 1)

    Observer agreement in classifying endometrial adenocarcinoma under a two or three category system.

  2. 2)

    Observer agreement in ordering p53 immunostaining and predicting p53 immunostaining results

  3. 3)

    Relationship between a diagnosis of ‘indeterminate’, interobserver agreement and p53 immunostaining.

Each dendrogram depicts all of the cases individually as rows, and the interpretation of each reviewer in columns. When observers agree on a given interpretation for a specific case, the line color is continuous across all three columns. When only one or two observers agree, the line is interrupted by a different color in the column of the non-concordant observer for that case. In each analysis, cases are reordered to arrange those with unanimous, majority and minority agreement into clusters. The latter device provides a rapid visual assessment of the level of concordance across observers (Figures 2 and 3).

Figure 2
figure 2

Examples of cases for which the reviewers achieved unanimous agreement of Type I (a) and Type II (b) carcinomas. Example of a case with non-unanimous agreement (c). Dendrograms depicting clustering of observer diagnoses (columns) by cases (rows) for a binary system (d) and three class system (e). Clustering of observer decisions regarding the need for p53 immunostaining is also depicted (f) (see Materials and methods).

Figure 3
figure 3

Case examples with either non-unanimous agreement or re-classified as indeterminate by one or more observers, with p53 immunostaining. (a and b) Cases with endometrioid growth pattern and prominent nuclear atypia, both p53-positive (insets). (c) A biphasic pattern with both endometrioid (upper) and serous (lower) differentiation, the latter positive for p53 (inset). (d) Papillary architecture in a tumor, presumably endometrioid, p53 negative (inset). (e) A dendrogram compares staining predictions of the three observers with the actual staining interpretation (columns), for each case (rows).

Results

Interobserver Agreement of Lesion Classification

Figure 2 displays cases with uniform agreement for Type I and II adenocarcinoma (Figures 2a and b) and non-uniform agreement (Figure 2c) under a binary classification system. Using a binary classification system, reviewers achieved very good to excellent interobserver reproducibility (k=0.71, Figure 2d). All three observers agreed on the diagnosis of Type I or Type II in 84 and 72 percent of cases respectively (Table 2). Examples of cases unanimously classified as Type I and II are illustrated in Figure 2a and b.

Table 2 Diagnostic concordance and p53 outcomes by diagnostic schema

When the expanded (3-group) classification system was used by the reviewers, concordance was good but the reduction in concordance relative to the binary system reflected variability in re-classification of cases as indeterminate (overall k=0.49, Figure 2e). Two or more observers agreed on the diagnosis of Type I or Type II in 78 and 83 per cent of cases (Table 2). Of the 44 cases receiving a score (by at least one observer) of ‘indeterminate’, two or more observers agreed on this diagnosis in 34%. Examples of cases in this category are illustrated in Figures 3a–d.

Classification and p53 Immunostaining

There was poor interobserver concordance for the selection of p53 immunostaining, reflecting different thresholds of observer commitment to this test (k=0.28, Figure 2f). However, in terms of p53 staining outcome, the proportion of cases scoring positive for p53 closely paralleled the certainty of the diagnosis (Table 2, Figure 3). Moreover, the p53-staining outcome paralleled the expectations of the observers, irrespective of whether the stain would have been ordered. When an expanded classification was used, only 3% of cases with a unanimous diagnosis of Type I scored p53-positive. This contrasts with 88% of cases with a unanimous diagnosis of Type II. Depending on the observer, 19 to 21 percent of cases were re-classified as indeterminate using the expanded system (Table 2) and from 50 to 80 per cent of these cases scored positive for p53 (Table 2). In 15 cases, at least two of three observers were concordant in assignment to the uncertain class. Examples of indeterminate cases scoring p53-positive are illustrated in Figure 3.

Resolution of Cases Classified as Indeterminate

Overall, in 44 cases, 1/3 (29) 2/3 (13) or 3/3 (2) observers reclassified cases into a third category. The most common reasons were overlapping histologic characteristics. These are illustrated in Figure 3a–d and the correlation of p53 staining with expectations is summarized in Figure 3e. Cases in the indeterminate category included those with discordances in cytohistologic architecture, in which an endometrioid growth pattern was accompanied by a higher than expected nuclear grade (Figure 3a and b), a perception of two discrete histologic patterns (Figure 3c) and villoglandular architecture that prompted a diagnosis of indeterminate (Figure 3d). Others included diagnostic uncertainty due to stated difficulties in classification, sampling and preservation.

Discussion

This study has demonstrated that a binary classification system of endometrial adenocarcinomas permits recognition of the two categories with excellent reproducibility. However, it also has shown that a minority of these tumors fall into a category that is not easily classified into either of the two, and that approximately one-third of these cases will pose problems for a majority (two of three) of observers. Indeterminate or intermediate histologic features, which resulted in assignment to the uncertain category included the following (Figure 3): (1) Mixed histotypes: some serous carcinomas are heterogeneous in their histologic appearance, being composed in part of patterns that are indistinguishable from endometrioid differentiation. This group includes tumors that are homogeneously positive for p53 (ie serous carcinoma with endometrioid pattern) and tumors that are a bona fide hybrid of the two, in which p53-negative (endometrioid) and p53-positive (serous) components co-exist. (2) Cytologic-architectural discrepancy: some tumors with endometrioid architecture display an usually high degree of nuclear atypia and p53 immunopositivity. This is being increasingly recognized as a probable subset of serous carcinomas.6 (3) Similarity of villoglandular and papillary architecture: endometrioid tumors with a prominent villo-papillary architecture may superficially resemble the papillary pattern of Type II carcinomas, but are homogeneously p53-negative. (4) Sample adequacy: accurate evaluation may be compromised by fragmentation or scanty sampling making the case uninterpretable.

There is abundant literature supporting the poor prognosis of the serous (Type II) phenotype and the significance of p53 immunostaining as a helpful prognostic indicator.11 The relative roles of both nuclear atypia and p53 positivity as independent (of each other) prognostic markers is controversial, due in part to the fact that the two are so closely linked to serous carcinoma.12, 13 However, investigators have verified the existence of p53-positive tumors with an endometrioid pattern, and in one study, p53 emerged as an important prognostic marker in this subset.14, 15 In view of the prognostic significance of p53 immunostaining with outcome, this marker has been proposed by some as a useful adjunct in the classification of selected endometrial adenocarcinomas.

Based on this study, a proportion of endometrial carcinomas can be expected to pose a problem in classification for a practicing pathologist, or be classified discordantly by two or more pathologists. The strength of association (or lack of) with p53, as shown in Table 2, increases as a function of agreement between observers, particularly in cases classified as endometrioid (Type I) and serous (Type II) carcinomas in a 3-tier system. Likewise, cases classified as indeterminate, particularly when two or more pathologists agree, carry a high rate of p53 positivity (Tables 2 and 3). Because the classification of a tumor as grade 2 or higher—either pre- or intra-operatively—might influence the decision to perform regional lymph node dissection, a high level of interobserver agreement is germane to consistent management. Whether p53 immunostaining of difficult cases will identify tumors at higher stage is less clear.16 Because the number of misclassifications is kept to a minimum when cases are reviewed by at least two observers, the case can be made for requiring review of endometrial malignancies by a second pathologist if the tumor cannot be readily classified as Type I. p53 immunostaining should be considered if the two observer diagnoses are not concordant.