Main

In the past ovarian carcinoma was regarded as to be a single entity with a spectrum of disease in which well-differentiated carcinoma progressed to poorly differentiated tumours (Kurman and Shih Ie, 2011). Moreover, histopathological ovarian carcinoma typing was afflicted with only modest reproducibility (Kobel et al, 2014). Treatment decisions were based solely on tumour stage (pre-operative tumour burden), post-operative tumour residuals, and grade. Novel therapeutic approaches to ovarian carcinoma were hindered by a poor understanding of the molecular heterogeneity of the disease, thus patients are today still treated according to a ‘one size fits all’ approach: tumour debulking surgery aiming at the complete resection of all tumour visible and cytotoxic chemotherapy based on platinum and taxanes. However, over the last two decades studies on ovarian carcinoma morphology, immunohistochemistry, and molecular genetics have led to the proposal that histotypes of ovarian carcinoma are best considered to be separate disease entities differing with respect to precursor lesions, molecular events during oncogenesis, pattern of spread, response to chemotherapy, and outcome (Kobel et al, 2008; McCluggage, 2011; Prat, 2012a). New insights into ovarian carcinoma pathology are driving change in ovarian carcinoma diagnoses and have recently served to modify the WHO Classification of Ovarian Tumors. According to the current 2014 classification epithelial ovarian cancer comprises five major histological types namely high-grade serous (HGSC), low-grade serous (LGSC), endometrioid (EC), clear cell (CCC), and mucinous ovarian carcinoma (MC) (Kurman et al, 2014). In contrast, the former 2003 WHO classification divided ovarian carcinoma into eight different histological types namely serous, EC, CCC, MC, transitional cell, squamous cell, mixed type, and undifferentiated ovarian carcinoma. Furthermore, HGSC and LGSC were not recognised as being separate histological types, but rather considered to represent different grades of one tumour entity (Tavassoli and Devilee, 2003).

Accurate histological typing taking into account recent diagnostic developments already has a key role in ovarian carcinoma research and will furthermore become indispensable clinically as there is a move towards histotype-specific treatment (McAlpine et al, 2009; Anglesio et al, 2012).

It was the aim of this study to assess how the evolution of histological ovarian carcinoma diagnostic criteria have influenced histotype diagnosis made by gynaecopathological experts and what were the likely clinical impact of such changes. In order to do so, a clinical trial cohort, which had undergone expert gynaecopathology case review in 2002, was now re-reviewed by the same pathologist (FK). Results were then compared with the results of independent review by another expert gynaecopathologist (BG). Furthermore, we hypothesised that revisiting historical survival analyses and biomarker studies using current diagnoses might lead to previously unexpected original new findings. Data from a previously published study showing independent prognostic significance of p16(INK4a) and pRb in advanced-stage ovarian carcinoma were recalculated (Kommoss et al, 2007).

Materials and methods

Tumour samples collected for translational research purposes from the AGO OVAR3 trial (prospective randomised, multi-centre, phase III trial comparing cisplatin+paclitaxel vs carboplatin+paclitaxel in FIGO IIB-IV ovarian carcinoma patients) formed the basis of the current study (du Bois et al, 2003). In 2002 central pathology review of a series of 334 study patients was performed by pathologist A (FK), established referral centre gynaecopathologist at that time. After exclusion of tumour samples insufficient for diagnosis of invasive ovarian carcinoma a series of 302 cases were available for further analysis. Tumours were typed according to the then current WHO criteria and were classified as either serous, MC, EC, CCC, transitional cell or undifferentiated ovarian carcinoma (Scully, 1999; Tavassoli and Devilee, 2003; Kommoss et al, 2005). Tumour grade was assessed according to the FIGO grading system (International Federation of Obstetrics and Gynecology, 1971).

Twelve years later all original slide material from the same cohort was re-reviewed by pathologist A, applying current diagnostic criteria. An independent review was performed by pathologist B (BG), another internationally recognised expert gynaecopathologist. Both pathologists were blinded to the original 2002, as well as to each other’s current diagnoses. According to the 2014 WHO criteria carcinomas were now typed as either HGSC, LGSC, EC, CCC, and MC (Kurman et al, 2014). Any discrepancies arising between current diagnoses of pathologist A and the evaluation of pathologist B were discussed on a case-by-case basis.

Kaplan Meier survival analysis based on original diagnoses of pathologist A was compared with that using current final consensus diagnoses of pathologists A and B. A historical biomarker study data set was retrieved and statistics were recalculated using the same consensus histotyping data (Kommoss et al, 2007).

Results

Upon retrieval of original study material from the AGO OVAR3 trial series, historical 2002 pathology review data and all pertaining original histological slides of 286 out of 302 cases were available. In comparison with the original review diagnoses, the distribution of histological types differed considerably both after re-review by pathologist A, as well as after independent review by pathologist B (Figure 1). Taking all histotypes together, pathologist A confirmed only 155 out of 286 (54%) of his specific histotype diagnoses. In the original review, 131 out of 286 (46%) of cases had been assigned to the serous type. At that time serous carcinomas were regarded as a single tumour entity of varying tumour grade (G1: 26 out of 131 (20%); G2: 71 out of 131 (54%); and G3: 34 out of 131 (26%). The distribution of non-serous carcinoma types was as follows: MC, 23 out of 286 (8%); EC, 52 out of 286 (18%); CCC, 35 out of 286 (12%); transitional cell, 14 out of 286 (5%); and undifferentiated, 31 out of 286 (11%). In keeping with current WHO criteria, tumours with serous differentiation were classified as either being of HGSC or LGSC type upon re-review by pathologist A, HGSC (229 out of 286; 80%) now being the most frequent histological type. The distribution of histological types among the remaining cases was as follows: LGSC, 21 out of 286 (7%); MC, 9 out of 286 (3%); EC, 14 out of 286 (5%); and CCC, 13 out of 286 (5%).

Figure 1
figure 1

Histological types in a series of 286 ovarian carcinomas after pathological review in 2002 (pathologist A 2002, left column) as compared with re-review applying current diagnostic criteria by the same reviewer (pathologist A 2014, middle column) and by another expert (pathologist B, right column).

Comparing pathologist A’s original and current histotype diagnoses, all cases with an original diagnosis of undifferentiated (n=31) and transitional cell carcinoma (n=14) were reclassified as HGSC. In the remaining cases, a diagnostic shift to different histotypes was observed in varying percentages, the most frequently found being serous (any grade)→HGSC (112 out of 131, 85%), EC→HGSC (40 out of 52, 77%), and CCC→HGSC (22 out of 35, 62%). Examples of cases of HGSC, based on 2014 review, that were previously diagnosed as CCC and endometrioid ovarian carcinoma are given in Figure 2.

Figure 2
figure 2

Examples of cases of HGSC, based on 2014 review, that were previously diagnosed as CCC and EC. (AC) The 2014 review HGSC, diagnosed as CCC in 2002. (A) Cells with clear cytoplasm, suggestive of CCC. (B) Typical architectural and cytological features of HGSC from elsewhere in this tumour. (C) Moderately intense diffuse WT1 immunopositivity. (DF) The 2014 review HGSC, diagnosed as EC in 2002. (D) Villoglandular architecture suggestive of endometrioid ovarian carcinoma. Note absence of squamous differentiation. (E) Typical architectural and cytological features of HGSC from elsewhere in this tumour. (F) Focal weak but convincing WT1 immunopositivity.

Subgroup analyses of those tumours diagnosed as serous carcinomas (any grade) in 2002 showed that all G3 (34 out of 34, 100%) and the vast majority of G2 tumours (70 out of 71, 99%) were diagnosed as HGSC. In contrast the G1 tumours were diagnosed on re-review as HGSC (8 out of 26, 23%) and LGSC (18 out of 26, 62%), respectively.

The relative frequency of histological types according to independent review by pathologist B was almost identical to that after pathologist A’s 2014 re-review: HGSC, 233 out of 286 (81%); LGSC, 21 out of 286 (7%); MC, 9 out of 286 (3%); EC, 10 out of 286 (4%); and CCC, 13 out of 286 (5%). For comparison of the historical 2002 review by pathologist A, his 2014 re-review and the 2014 independent review by pathologist B see Table 1.

Table 1 Distribution of ovarian carcinoma histotypes according to historical 2002 review, current 2014 re-review by pathologist A, and 2014 independent review by pathologist B

Comparing pathologist A’s and pathologist B’s 2014 diagnoses there was a concordance of histotype diagnosis in 280 out of 286 (98%) cases. Of the six discrepant cases, pathologist A diagnosed four as being of EC type (pathologist B: all HGSC), one as being HGSC (pathologist B: LGSC), and one as being LGSC (pathologist B: HGSC), as shown in Figure 1.

After online case discussion, based on scanned H&E sections and taking into consideration an additional WT1 immunostain in one case, a diagnostic consensus was reached in all six discrepant cases (HGSC, n=5; endometrioid, n=1).

Survival analyses based on pathologist A’s 2002 findings showed a significantly better survival for patients with a diagnosis of transitional cell carcinoma, with the other histotypes being of no prognostic significance. Applying 2014 diagnostic consensus data, MC and CCC showed a significantly worse outcome as compared with HGSC, LGSC, and EC (Figure 3).

Figure 3
figure 3

Survival by ovarian carcinoma histotype. (A) Survival using 2002 review diagnoses. Significantly better survival for TCC, the other histotypes of no prognostic significance. (B) Survival using 2014 consensus diagnoses. Significantly worse prognosis for MC and CCC as compared with HGSC, LGSC and ENDO.

Taking this new prognostic information into consideration the prognostic significance of p16 and pRb expression data were also reevaluated. Type-specific analyses of historical p16 and pRb expression data showed no significant correlation of 2002 original diagnoses and biomarker expression (Figure 4A and B). However, p16 negativity and high levels of pRb expression, both originally described as being indicative of an unfavourable outcome (any histotype), were found to be significantly more often present in CCC and MC (Figure 4C and D), with overall worse prognosis in pRb high cases. While previously published data showed a worse prognosis for p16 negative cases (all histotypes), p16 negativity now indicated a prognostic disadvantage only in the subgroup of CCC and MC, whereas in all other histotypes it conveyed no significant prognostic information (Figure 5).

Figure 4
figure 4

Biomarker expression by histotype. (A) p16 negative cases, 2002 diagnostic criteria. No significant correlation with histotype (P=0.09). (B) pRb high cases, 2002 diagnostic criteria. No significant correlation with histotype (P=0.4). (C) p16 negative cases, 2014 diagnostic criteria. CCC and MC significantly more often p16 negative (P=0.007). (D) pRb high cases, 2014 diagnostic criteria. CCC and MC significantly more often pRb high (P=0.009).

Figure 5
figure 5

Kaplan Meier survival estimates for p16 expression. (A) Published data applying 2002 diagnostic criteria, all histological types (n=286) (Kommoss et al, 2007). Significantly worse prognosis for p16 negative patients (P<0.001). (B) The 2014 review CCC and MC types only (n=22). Significantly worse prognosis for p16-negative patients (P=0.02). (C) The 2014 review HGSC, LGSC, and EC types only (n=264). p16 of no prognostic significance (P=0.17).

Discussion

The subclassification of ovarian carcinomas has been routinely based on histopathological features, specifically the resemblance of tumour cells to different cellular lineages characteristic of the mullerian tract, for example, fallopian tube, endometrium, or endocervix, as this classification system was formally proposed by the WHO in the 1970s (Poulsen et al, 1975). Shortly after the introduction of this classification scheme it was demonstrated that there was only moderate interobserver reproducibility in assignment of histotype, and there was also an appreciation that many cases showed considerable intratumoral heterogeneity in morphological appearance, resulting in many being diagnosed as mixed carcinoma, that is, consisting of an admixture of different histotypes (Mackenzie et al, 2015). The lack of reproducibility meant that histotype could not be used as a basis for patient management, while the frequent diagnosis of mixed carcinoma suggested that the histotypes were very closely related entities, often intermingling in an individual case, further evidence that went against considering histotype-specific patient management. The practice of considering ovarian carcinoma as a single disease with subtle morphological variants of little or no clinical significance was widespread until recently (Fletcher, 2013).

It was appreciated a decade ago that the mucinous and clear cell histotypes were associated with a less-favourable response to chemotherapy, and a concensus panel suggested that histotype-specific treatments should be developed for these histotypes (Fountain et al, 2006). At the same time, there was (1) development of histotype specific biomarkers, such as WT1, a marker of serous differentiation (Acs et al, 2004); (2) recognition based on morphology and clinicopathological correlation that HGSC and LGSC are distinct diseases (Singer et al, 2002; Malpica et al, 2004); and (3) increasing awareness that mixed carcinomas are uncommon, with many cases so diagnosed in the past simply being HGSCs with intratumoral morphological heterogeneity (Mackenzie et al, 2015); these developments led to demonstrable improvements in intraobserver reproducibility of ovarian carcinoma diagnosis. This is the first study, however, where both the shift of histotype diagnosis and the improvement in diagnostic reproducibility have been quantified, comparing diagnoses made previously, according to the criteria in use 15 years ago to current criteria. A well-defined set of cases from a clinical trial was used, with the same gynaecopathologist reviewing the cases 15 years later, allowing assessment of intraobserver diagnostic shift. As well, the histotype diagnoses rendered independently by two gynaecopathologists applying current diagnostic criteria, but working completely independently (different countries/continents, no common training bias/had never practiced in the same centre, or viewed cases together before this study) were compared. We have shown a very marked shift in diagnoses with only 54% intraobserver agreement in histotype diagnosis from 2002 to 2014. This reflects the profound effect over time new insights into the biology and morphology have had on the histopathological evaluation of ovarian carcinoma, as reflected in the current (WHO, 2014) classification system. In contrast, there is near perfect (98%) interobserver agreement in histotype diagnosis between two gynaecopathologists using current diagnostic criteria. This latter level of agreement exceeds the diagnostic accuracy demanded for predictive biomarkers such as HER2 testing, where 95% accuracy is required, according to the ASCO-CAP guidelines (Wolff et al, 2013), and places histotype diagnostic accuracy within the realm, where reproducibility is sufficient to allow for histotype specific management, such as placing patients into histotype-specific clinical trials.

The five main histotypes of ovarian carcinoma differ with respect to precursor lesions, molecular events during oncogenesis, genetic risk factors/association with hereditary cancer syndromes, patterns of spread, response to chemotherapy, and patient outcomes (Prat, 2012b). These differences were obscured in the past, when histotype was not accurately diagnosed. For example, within the clinical trial cases we report on, histotype only becomes of prognostic significance with the 2014 diagnoses, where MC and CCC are associated with a worse prognosis. These results show clearly the inadequacy of using historical diagnoses for research purposes, even when rendered by an experienced gynaecopathologist. This can also be seen in the results of the CIMBA study, where ovarian carcinoma histotype of more than 3000 ovarian carcinomas arising in patients with BRCA1 or BRCA2 mutations was reported (Mavaddat et al, 2012). The diagnoses were not based on re-review with use of current diagnostic criteria, and <70% of tumours were reported as being serous. In contrast, in case series with pathology re-review and use of current diagnostic criteria, almost all BRCA1 or BRCA2 associated ovarian carcinomas are high-grade serous type (Schrader et al, 2012). The impact of accurate histotype diagnosis on biomarker research is also demonstrated in our results; our previously reported association between pRB and p16 and prognosis (Kommoss et al, 2007) is seen to be largely attributable to correlation of expression of these markers with the unfavourable clear cell and mucinous histotypes, a finding obscured when the earlier, less-accurate histotype diagnoses were used. For biomarker studies to be interpretable it is critical that there be analysis with histotypes considered separately, and that histotype diagnoses be made based on current diagnostic criteria; historical histotype diagnoses are worse than useless, and can lead to incorrect conclusions. One can see prognostic significance when there is none, as for pRB or p16, or using histotype diagnosis important prognostic relationships may be obscured, as we demonstrated previously, that is, T-cell infiltrates are of prognostic significance in HGSC, but are not significant when all histotypes are considered together (Clarke et al, 2009). Cancer diagnoses have undergone a revolution over the past decade, with the ability to subdivide ‘common’ cancers into subtypes with distinct molecular profiles. This is perhaps best established for breast carcinoma, where it would be inconceivable to plan treatment without assessment of hormone receptor and HER2 status, but has also emerged for many other solid tumour types that, such as ovarian carcinoma, were formerly considered monolithic. For example, renal cell carcinoma can also be subdivided into histotypes based primarily on H&E assessment, with judicious use of immunohistochemistry and/or molecular testing in selected cases (Srigley et al, 2013). Ovarian carcinoma differs from breast but is similar to renal carcinomas in that a primarily morphological classification, with molecular testing as adjunct in selected cases, allows diagnosis of the molecular subtypes. There is no such thing as ‘ovarian carcinoma’ and we must look to generate new data defining the natural history, including response to treatment, of the ovarian carcinoma histotypes, as the older literature, where histotype diagnoses were based on historic criteria, or there was no attempt to separate cases based on histotype, is meaningless.

Our data demonstrate how much ovarian carcinoma histotype diagnosis has evolved over recent years, and begs the question of whether this trend will continue. We believe that the current histotypes will be stable into the future, and will not continue to change markedly. Our belief that the histotypes, as currently defined, are stable diagnostic categories, is based on the extensive molecular classification that has been done that supports them being distinct entities, as well as the corroborating clinical studies demonstrating, as seen in our results, differences in response to treatment and outcomes for the different histotypes. This has important implications; for example, it would be appropriate to revisit old data sets, in order to better characterise the histotypes with respect to response to treatment. In the future we can anticipate better understanding of the histotypes; while new prognostic and predictive markers will emerge, they will almost certainly prove to be histotype specific, and treatment will continue to evolve in the direction of histotype-specific therapies. The very reproducible diagnosis of histotype that we have shown supports histotype as a basis for ovarian carcinoma subclassification. This situation for ovarian carcinoma is different from that of EC, where histotype cannot be reproducibly diagnosed, or breast carcinoma, where 15 years after description of basal-like carcinomas they are not consistently diagnosed in routine practice; for both endometrial and breast carcinoma further refinement of histotype diagnosis can be anticipated. In contrast, for ovarian carcinoma we now have reproducibly diagnosable and molecularly distinct histotype and it is completely unacceptable to use historical histotype diagnoses for research purposes. It is time to start over, and our understanding of ovarian carcinomas must be based on accurate histotype diagnosis using current criteria.