Main

Neuroendocrine neoplasia encompasses a wide variety of organ sites, tumor grades, and clinicopathogical behaviors. Neuroendocrine differentiation can be recognized by morphological features along with immunohistochemistry (IHC). However, identification of the tumor site of origin in certain clinical contexts may be exceedingly challenging, due to the significant morphological and immunophenotypic similarity within this family of tumors. Clinical situations that may cause diagnostic subtyping challenges include: multiple possible primary tumors (in the context of Multiple Endocrine Neoplasia syndromes), widespread metastatic disease, lack of tumor-expressing site-specific serological markers and small biopsies with insufficient clinical data. The site of origin for neuroendocrine carcinoma has become increasingly important for grading/staging purposes,1, 2 for new clinical management guidelines,3, 4, 5, 6, 7 and for primary site-specific targeted therapy.8, 9, 10 When a primary site cannot be identified, tumors generally are treated according to the presumed aggressiveness of the tumor, as determined by a combination of the tumor grade11, 12, 13, 14 and available clinical and radiographic information about tumor metabolism.14

Because neuroendocrine carcinoma subtyping for site of origin is critical for clinical management, research has been dedicated to finding site and subtype-specific diagnostic markers.15, 16, 17, 18 Several biomarkers have been utilized to determine tumor subtype. eg, calcitonin is expressed by a majority of medullary thyroid carcinomas, and CK20 is characteristically expressed in Merkel cell carcinoma.15, 16 Some authors have proposed a panel of immunohistochemical stains (CDX-2, PDX-1, NESP-55, TTF-1, PAX8) to distinguish between differentiated gastrointestinal, pancreatic, and pulmonary neuroendocrine carcinoma with some success in tumor specificity;17, 18 however, this approach has seen limited utility in clinical practice due to relatively low sensitivities. Functional neuroendocrine carcinoma may have characteristic serology results and clinical presentation (eg, gastrinoma, insulinoma, etc.). Despite extensive clinicopathological investigation, however, it is estimated that up to 10% of patients with differentiated neuroendocrine carcinoma will have an unknown primary site.14

Traditionally, pathologists have used panels of protein-based immunohistochemical stains for tumor classification, but the potential advantage of molecular tests is readily apparent. In recent years, RNA expression-based classification of tumors has become an attractive standardized approach to aiding in the diagnosis and subclassification of tumors19, 20, 21, 22, 23, 24, 25 and for individualized therapy applications.26, 27, 28 In the coming years, gene expression-based analysis may serve as a complementary approach to genome-wide DNA analysis of tumors for individualized therapy.

A cohort of tumors with neuroendocrine differentiation was previously analyzed during a large-scale validation study of the 92-gene assay,20 an RT-PCR-based molecular cancer classifier. The larger validation study was designed to assess the overall performance of the 92-gene assay across 90 different tumor morphologies, comprising 28 different tumor types and 50 subtypes. The aim of the current study is to report detailed results on the performance of the 92-gene molecular classifier in subtyping neuroendocrine tumors and further highlight its potential diagnostic utility. In addition, exploratory analyses were conducted to examine and identify subsets of the 92-gene panel for specific use in neuroendocrine classification.

Materials and methods

Case Selection and Adjudication

Case selection criteria and methodology for the larger 92-gene classifier validation trial have been reported elsewhere; this trial included 1017 tumors from 28 tumor types and over 50 tumor subtypes.20 For the current study, we included all tumors (n=75) from the original trial that were considered to have neuroendocrine differentiation, including: Merkel cell carcinoma, medullary thyroid carcinoma, pheochromocytoma, paraganglioma, pulmonary neuroendocrine carcinoma (carcinoid, small cell carcinoma, large cell neuroendocrine carcinoma), pancreatic neuroendocrine carcinoma (all grades), and gastrointestinal neuroendocrine carcinoma (all grades; stomach, small intestine, appendix, and colorectum). Both primary and metastatic cases were included. Excluded were some sites of ‘epithelial’29 neuroendocrine tumors (thymus, pituitary, kidney, bladder, cervix, ovary), carcinomas with occult/mixed neuroendocrine differentiation, and most of the rarer “neural”29 types of neuroendocrine tumors (neuroblastoma, olfactory neuroblastoma, central nervous system primitive neuroectodermal tumors). Each case had been reviewed for diagnostic accuracy by consensus of two pathologists at different institutions. Case adjudication was performed by a primary pathologist through evaluation of clinical glass slides and available medical records and by a second pathologist who viewed a selected slide(s) by online whole slide digital imaging (Spectrum and ImageScope, Aperio Technologies, Vista, CA, USA) with clinicopathological information provided by the originating pathologist. Only adjudicated cases in which pathologists at both institutions agreed upon a consensus diagnosis for tumor type and subtype were included in the study (K=1). Cases were graded according to the grading criteria for each subtype as outlined in Klimstra et al1 and Hochwald et al30 using mitotic rate and tumor necrosis as applicable. These cases were selected and analyzed before the recent inclusion of Ki-67 in neuroendocrine tumor grading schemes, and thus this marker was not performed. Merkel cell carcinomas were considered grade 3. Grade 1 and 2 tumors were considered to be well-differentiated tumors, while grade 3 tumors were considered to be poorly differentiated. Medullary carcinomas and pheochromocytomas/paragangliomas were not graded.

Molecular Testing with the 92-Gene Assay

After passing diagnostic adjudication, a selected formalin-fixed, paraffin-embedded block was sectioned in RNAase-free conditions to produce one hematoxylin and eosin-stained section and three unstained 7-μ sections for molecular testing. The freshly prepared slides included only a research ID and were shipped to bioTheranostics (San Diego, CA, USA) for testing with only information on patient gender and biopsy site. Samples were macrodissected using the H&E-stained template or laser capture microdissected for tumor enrichment. Tumor enrichment after manual microdissection and laser-guided microdissection were >80% and >90%, respectively. Total RNA was extracted and DNase treated. First-strand cDNA was synthesized and then was pre-amplified (PreAmp, Life Technologies, Carlsbad, CA, USA). Real-time PCR was then performed using an ABI 7900HT instrument quantitatively measuring the expression of 87 tumor-associated genes and 5 reference genes as previously described21 and quality control parameters applied.20 The raw quantitative data was compared with a reference set of 2094 tumors (representing 28 main tumor types and 50 histological subtypes) for prediction of tumor type and subtype by proprietary statistical algorithm. Neuroendocrine tumor types/subtypes in the 92-gene assay panel include Adrenal—pheochromocytoma/paraganglioma, Neuroendocrine—skin (Merkel cell carcinoma), Neuroendocrine—lung low grade (pulmonary carcinoid), Neuroendocrine—lung high grade (pulmonary small cell carcinoma or large cell neuroendocrine carcinoma), Neuroendocrine—intestine (neuroendocrine tumors of all grades from the alimentary tract), Neuroendocrine—pancreas (pancreatic endocrine tumors), and Neuroendocrine—thyroid (medullary thyroid carcinoma). Quality control parameters for the 92-gene assay were described previously.20

Analysis of 92-Gene Panel Subsets

To select a gene subset for typing of neuroendocrine tumors, receiver operating characteristic analysis was performed for each of the 87 tumor-associated genes using 2094 tumors from the 92-gene assay reference database to assess their discriminatory power to differentiate neuroendocrine tumors (N=290) from non-neuroendocrine tumors (N=1804). Genes with the highest area under curve were chosen, and their performance in 957 cases from a blinded validation study was examined.20 The 75 neuroendocrine cases validated in this study do not overlap at all with the 290 cases from the training set in the reference database.

To identify a gene subset for subtyping neuroendocrine tumors, analysis of variance (ANOVA) was conducted for each of 87 genes using the 290 neuroendocrine tumors in the reference set. Genes with smallest P values were the ones that best distinguish the subtypes of neuroendocrine tumors and were thus selected as candidates for subtyping. The performance of the selected genes in the 75 neuroendocrine tumors from the validation study cohort was assessed by principal component analysis (PCA) and visualized in a three-dimensional plot using the first three principal components to examine the separation of different neuroendocrine subtypes.

Results

Performance Characteristics of the 92-Gene Classifier in Neuroendocrine Tumors

The performance of the 92-gene classifier in accurately classifying 28 tumor types and 50 subtypes has been reported.20 All 75 neuroendocrine tumors in this study met quality control parameters and were classified by the assay. The cohort included 44 females and 31 males, with a mean age of 62 years (range 29–86). Tumor characteristics are provided in Table 1. Cases were comprised of 59% metastatic tumors and 41% primary tumors. The most common biopsy site was liver, followed by lung and lymph node (Figure 1). The performance characteristics for the 92-gene assay predictions of neuroendocrine subtype are shown in Table 2. Assay sensitivities were 99% (95% CI: 0.93–0.99) for accurate classification of neuroendocrine tumors and 95% (95% CI:0.87–0.98) for identification of tumor subtype for the site of origin. Positive predictive values ranged from 0.83 to 1.00 for individual subtypes. A confusion matrix comparing the reference diagnosis with the 92-assay results is shown in Table 3; this highlights areas of concordance and discordance between the 92-gene classifier subtyped cases and reference diagnosis. The concordance rate of the molecular results with the reference diagnoses in poorly differentiated neuroendocrine carcinoma (grade 3 tumors) was 87% (20/23), whereas for well-differentiated neuroendocrine carcinoma (grade 1 and 2 tumors from the gastrointestinal tract, pancreas, or lung) it was 97% (30/31, see examples in Figure 2).

Table 1 Case characteristics
Figure 1
figure 1

Distribution of biopsy sites. The height of the bars represents the number of cases included from each biopsy site. Dark gray bars indicate metastatic tumors to the sample site and light gray bars indicate primary tumors at the sample site. The most common sites were liver metastases and lung primaries followed by lymph node metastases.

Table 2 Performance characteristics of the 92-gene classifier for neuroendocrine tumor subtyping
Table 3 Matrix demonstrating the relationship between reference diagnosis and 92-gene assay prediction
Figure 2
figure 2

Examples of well-differentiated neuroendocrine carcinomas classified correctly by the 92-gene assay, H&E, × 200. Top: Primary pulmonary carcinoid, prediction strength 96%. Middle: Pancreatic endocrine tumor metastatic to liver, prediction strength 96%. Bottom: Ileal carcinoid metastatic to the liver, prediction strength 96%.

Four cases had discordant 92-gene assay predictions compared with the reference diagnosis (Table 4). Three of the four cases were correctly predicted as neuroendocrine carcinoma but were discordant at the subtype (site of origin) level. Case 1 was adjudicated as an endobronchial pulmonary well-differentiated neuroendocrine (carcinoid) tumor with liver metastases at the time of primary diagnosis that was predicted by the assay to be a pancreatic endocrine primary. Case 2 was a pulmonary small cell carcinoma predicted to be Merkel cell carcinoma. Case 3 was a poorly differentiated pancreatic neuroendocrine carcinoma predicted to be a Merkel cell carcinoma. Case 4 was adjudicated as a poorly differentiated pancreatic neuroendocrine carcinoma and predicted to be a non-seminomatous germ cell tumor; however, a neuroendocrine tumor type was not ruled out by the assay (data not shown).

Table 4 Investigation of cases with discordant 92-gene assay results

Exploratory Analysis of Neuroendocrine Gene Subsets

Further analysis was explored to potentially define a smaller subset of genes within the 92-gene assay panel with high sensitivities and specificities for neuroendocrine classification and subtyping. Of note, the genes encoding synaptophysin and chromogranin are not part of the 92-gene assay panel.21 Four genes demonstrated high discriminatory ability for distinguishing neuroendocrine from non-neuroendocrine tumor types in the assay reference set (N=2094), based on an area under curve cutoff of ≥0.8 from the receiver operating characteristic analysis. Consistently, area under curve values for these 4 genes were >0.8 in the 957 cases from the validation study cohort (Table 5). Biomarker utility for discrimination of neuroendocrine from non-neuroendocrine tumors can also be seen in the stripchart plots showing the distribution of expression values of each gene in each of the subtypes of neuroendocrine cases, as well as in non-neuroendocrine cases in the validation cohort (Figure 3).

Table 5 Gene symbol, alternate names, chromosome location, and proposed/known function for genes differentially expressed in neuroendocrine tumors versus other tumor types
Figure 3
figure 3

Stripchart plots of the four selected genes with area under the curve >0.8. The plots are showing the distribution of CT values of each gene in each subtype of NE cases as well as in non-NE cases in the validation cohort.

The top 15 genes with significant P values from ANOVA analysis were selected as candidate genes to best distinguish different subtypes of neuroendocrine tumors in the reference set. These genes are described in more detail in Table 6 and include KIF2C,31, 32 SFTA3, CDCA3,33 KIF12,31, 34 CDH17,35, 36, 37, 38, 39 LOC100130899 (uncharacterized), NBLA00301,40, 41 HOXD11,42, 43, 44, 45, 46 EPS8L3, IRX3,47, 48 WWC1,49, 50, 51, 52, 53 HOXB8,54, 55, 56 FOXG1,57, 58, 59, 60, 61, 62 BCL11B,63, 64, 65 and LOC100506088 (uncharacterized). To visualize how well these 15 genes can distinguish neuroendocrine subtypes in the validation cohort, PCA were performed and the first three principal components were used to produce a three-dimensional plot showing the unsupervised clustering pattern of the different neuroendocrine subtypes (Figure 4). The PCA plot shows distinct separation of each neuroendocrine subtype and provides preliminary evidence that neuroendocrine subtyping may be feasible through optimization of the collective expression of the 15-gene set.

Table 6 Gene symbol, alternate name, chromosome location, and proposed/known function for genes most useful for predicting neuroendocrine tumor subtypes
Figure 4
figure 4

Three-dimensional Principle Component Analysis (PCA) plot shows the clustering pattern of the cases of different neuroendocrine tumor subtypes.

Discussion

Results from this blinded study demonstrate excellent performance of a 92-gene expression-based molecular classifier for prediction of tumor site of origin in a heterogeneous cohort of both primary and metastatic neuroendocrine tumors. These findings are particularly noteworthy, because neuroendocrine cases included in the current study were part of a larger validation trial20 that was completely blinded to tumor diagnosis, and especially pertinent here, neuroendocrine differentiation.

The 97% accuracy of the 92-gene assay for well-differentiated neuroendocrine tumors reported here is superior to published findings using IHC panels.17, 18 All well-differentiated neuroendocrine tumors from the gastrointestinal tract (12/12) and pancreas (8/8) and 91% (10/11) of pulmonary well-differentiated neuroendocrine tumors were correctly classified for the site of origin in our study; this included both metastatic and primary tumors. Correct identification of primary site in the metastatic setting is important, as treatment options and prognosis differ for thoracic-, pancreatic-, and gastrointestinal tract-based neuroendocrine tumors.4, 6, 7, 8, 9, 10, 66 Sangoi et al17 showed that IHC for PAX8 had only a 65% sensitivity for identifying pancreatic origin in well-differentiated neuroendocrine tumors metastatic to the liver, although it was 100% specific. Several cases of primary gastrointestinal neuroendocrine tumors in this study expressed PAX8. Long et al66 found similar results, with positive staining for PAX8 in only 50% of pancreatic neuroendocrine tumors metastatic to the liver, and with positive staining of all duodenal, 85% of rectal, and approximately 20% of appendiceal and gastric primary neuroendocrine tumors. In this study, the only gastrointestinal tumors metastatic to the liver that were tested for PAX8 were ileal tumors, which never showed any positive staining for PAX8 in the primary tumors. Srivastava et al18 demonstrated that an IHC panel including CDX2, PDX-1, NESP-55, and TTF-1 had limited performance for accurately predicting the primary site of gastrointestinal and pulmonary primary tumors, although it showed a sensitivity and specificity of 97% and 91%, respectively, for predicting pancreatic origin. In poorly differentiated tumors, the 92-gene assay showed rare discordant cases, but even in these diagnostically challenging cases the assay displayed an excellent overall performance overall of 87%.

Numerous pre-analytic and analytic factors can affect the reproducibility of protein IHC between and within labs. Some of the same pre-analytic factors affect both mRNA and proteins, including cold ischemic time, fixation methods and timings, tissue processing, and storage methods.67 Although mRNA degradation has been seen as more of a concern for mRNA than protein, both targets have been successfully adapted for formalin-fixed, paraffin-embedded processing. Protein IHC has additional variation in antibody retrieval methods and antibody clone differences, as well as wide variation in techniques and controls used between labs; all these factors can all affect the rate of false positive and false negative IHC results.68, 69, 70, 71, 72, 73, 74 In addition, selection of specific IHC markers and the interpretation of the protein expression patterns can be subjective and is dependent on the individual pathologist.

The strength of molecular diagnostics for tumor classification, including the 92-gene assay, lies both in standardized testing methods and in the comparison of gene expression between tumor samples and a well-adjudicated and robust expression database, provided rigorous quality controls are built into the assay process to evaluate specimens for appropriate mRNA integrity. Real-time, quantitative PCR for measurement of RNA expression is a standardized, highly reproducible, multiplexed panel of expression markers but with a logarithmically extended dynamic range of gene expression measurement not available in protein IHC. Because the signals are not directly visualized on tumor tissue, however, this assay is most effectively used with careful guidance by a pathologist to ensure sample selection for enrichment of tumor and exclusion of interfering normal cells (lymphocytes, fibroblasts, etc), a process of which is already growing rapidly within laboratories performing molecular oncology testing. The 92-gene assay routinely includes laser microdissection of tumor tissue, which is a key step that contributes to classification accuracy.19, 20

Feasibility analysis was performed to determine whether a subset of genes within the 92-gene biomarker panel could have value in predicting the site of origin for neuroendocrine tumors. Specifically, 15 genes were identified that showed reasonable discrimination between neuroendocrine tumors from different anatomic sites in this set of tested tumor samples. As their initial discovery was part of a data-driven process looking at differential gene expression across a diverse and wide variety of tumor types,21 and not for neuroendocrine typing in particular, mechanistic links to neuroendocrine differentiation or specific neuroendocrine tumor types are currently unknown. Many of the genes have been implicated in embryonic development, particularly in neuronal development (Table 6). Some genes have been implicated in minimally differentiated/embryonic-type tumors such as neuroblastoma and acute leukemias. The demonstration that the 15-gene subset of the 92-assay panel can discriminate differentiating tumor subtypes that are seemingly closely related supports the strength of using a collective gene expression profile for tumor subclassification. Future research on the specific function of these genes may provide additional insight into neuroendocrine tumor pathogenesis.

A limitation of this study was that it is a post hoc analysis from a larger validation trial that was powered to classify tumors from a much wider variety of tumor types and subtypes. For this reason, rarer neuroendocrine tumor types (eg, ovarian carcinoid, olfactory neuroblastoma, tumors with mixed/occult neuroendocrine differentiation) were not included. Additionally, this subset analysis also did not account for the prevalence of tumor subtypes in clinical practice. Further extension of the classifier to answer the question of prevalence effect may be particularly important for prospective trials of clinical and diagnostic utility in metastatic neuroendocrine carcinoma of unknown primary.

Findings from this study showed that there were four discrepancies between the 92-gene assay prediction and reference diagnosis. Review of these cases supports the reference diagnosis in all four cases based upon the adjudicated clinicopathological evidence by two pathologists at different institutions. In such situations, there always remains a question whether some of these cases could be incorrectly classified by current clinicopathological criteria or whether misclassification is due to n aberrant gene expression in the tested tumor. Practically, incorporation of both clinicopathological and molecular data (standardized gene expression data and targeted mutational analysis) will be required in order to properly classify patients for treatment with standard or targeted chemotherapies.

In conclusion, the 92-gene assay showed excellent accuracy for identifying neuroendocrine carcinomas and the neuroendocrine tumor site of origin in both well-differentiated and poorly differentiated tumors. This novel finding may have an immediate clinical application for distinguishing metastatic pancreatic endocrine tumors from other well-differentiated neuroendocrine carcinomas given that there are therapies (tyrosine kinase and mTOR inhibitors) specifically approved for pancreatic tumors.8, 9, 10 In addition, a subset of 15 genes within the 92-gene assay panel showed promising discriminatory ability to subclassify the tumor site of origin of neuroendocrine tumors. Further investigation of these genes may have diagnostic and theranostic value.