Introduction

Pathology consultation cases can be classified as either extramural or referral-based. Extramural consults are cases sent for expert review, usually at the direction of the originating pathologist before the diagnosis is finalized (also termed “primary consults”). In contrast, referral-based consults are reviewed after a diagnosis at an outside facility has been rendered at the request of the treating institution prior to treatment (also termed “secondary consults”). Both consultation types represent resource-rich materials that allow insights into important diagnostic issues. The largest study of referral-based consultations was performed by Mayo Clinic Rochester in 2013. Investigators studied more than 70,000 consults, identified an overall disagreement rate of 0.6%, and reported that the most frequent area of disagreements was in gastrointestinal cases (17.5%) [1]. While their overall disagreement rate was low, disagreements resulted in changes to treatment in 90% and prognosis in 92.1% of cases. Other consultation studies have reported higher disagreement rates in the range of 1.4–10% [2,3,4,5,6], with similar high impact on treatment and prognosis. This study evaluated extramural consults of the appendix submitted to seven academic institutions between 2016–2017.

Materials and methods

A study of extramural consults of the tubular gastrointestinal tract, appendix, liver, and pancreatobiliary tree resulted in the collection of 1365 consecutive consults submitted to nine participants at seven academic centers between 2016–2017. The nine participants were all consultant pathologists and they recorded the following de-identified data for each consult: person directing the case for consultation (pathologist, clinician, or patient), consultation question(s), site, pertinent history (present or absent) / results of imaging studies (present or absent) / relevant laboratory results (present or absent), submitted diagnosis, ancillary studies performed, and consultant’s final diagnosis. All results were forwarded to MST who aggregated the consults by organ system: liver 40% (n = 541), upper gastrointestinal tract 29% (n = 390), lower gastrointestinal tract 21% (n = 284), appendix 3% (n = 43), pancreatobiliary tree 7% (n = 96), and miscellaneous 0.8% (n = 11). CAA scored all appendix consults and MST confirmed the scoring per the scoring strategy outlined in Table 1.

Table 1 Scoring strategy

Results

Overview of consults

All appendix consults were resections (100%, n = 43), and the majority were directed for consultation by the originating pathologist (95%, n = 41; clinician-directed = 5%, n = 2). Additional studies were usually not performed by the consultant (65%, n = 28). When additional studies were performed, they were most often H&E levels (60%, n = 9). Of 38 cases with submitted diagnoses, half (19/38) were major disagreements, and most of them (13/19) were downgraded (Fig. 1). Most consults were submitted for tumor classification. Invasion assessment and serrated polyp classification were also common consultation inquiries (Fig. 2). Overlooked diagnoses from the submitting pathologist included granulomata (n = 1) and endometriosis (n = 1). Outdated or incorrect terminology in the submitted diagnoses included the terms “carcinoid” instead of “well-differentiated neuroendocrine tumor” (n = 1), “mucinous cystic neoplasm” (n = 3) or “cystadenoma” (n = 1) instead of “low grade appendiceal mucinous neoplasm”, and “adenocarcinoid” instead of “goblet cell carcinoid with adenocarcinoma (adenocarcinoma ex-goblet cell carcinoid)” (n = 1). Of the two consults directed by clinicians, one was classified in agreement with the original pathologist’s diagnosis of “adenocarcinoma ex-goblet cell carcinoid” and the other was a major disagreement-downgrade from the original diagnosis of “adenocarcinoma” to the consultant’s diagnosis of “low grade appendiceal mucinous neoplasm”.

Fig. 1
figure 1

Most study cases were finalized as major disagreements and downgraded from the submitted diagnosis. *One case had both a major disagreement-upgrade and downgrade: this case was submitted as low grade appendiceal mucinous neoplasm and upgraded by consultant to high grade appendiceal mucinous neoplasm (major disagreement-upgraded) and a major disagreement-downgraded (submitted as T3 and downgraded by the consultant to Tis)

Fig. 2
figure 2

The majority of consults were submitted for tumor classification. Invasion assessment and serrated lesion classification were also common concerns

Low grade appendiceal mucinous neoplasm-related inquiries comprised the majority of consults

Of the 43 total appendix consults, 44% (n = 19) were submitted by the contributing pathologist as low grade appendiceal mucinous neoplasm, but only half of these were diagnosed by the consultant as low grade appendiceal mucinous neoplasm (47%, n = 9). Four consults submitted as low grade appendiceal mucinous neoplasm were scored as major disagreements-upgraded based on the consultant’s diagnosis of low grade appendiceal mucinous neoplasm with high-risk features (n = 2) and moderately differentiated mucinous adenocarcinoma (G2) (n = 2). Five consults submitted as low grade appendiceal mucinous neoplasm were scored as major disagreements-downgraded based on the consultant’s diagnosis of benign appendix (n = 1), appendicitis (n = 1), and serrated polyp (n = 3). One case had both a major disagreement-upgrade (submitted as a low grade appendiceal mucinous neoplasm and upgraded by consultant to high grade appendiceal mucinous neoplasm) and a major disagreement-downgrade (submitted as T3 and downgraded by the consultant to Tis). Next, we studied the submitting diagnoses for those consult cases diagnosed by the consultants as low grade appendiceal mucinous neoplasm (n = 15). Of these, the submitted diagnoses from the contributing pathologists included low grade appendiceal mucinous neoplasm (n = 9), adenocarcinoma (n = 5, one of these was complicated by an appendiceal diverticulum), and one case was submitted without a diagnosis. Figures 3 and 4.

Fig. 3
figure 3

Low grade appendiceal mucinous neoplasm versus serrated polyp versus adenocarcinoma was a common diagnostic issue. a Discriminating features of a low grade appendiceal mucinous neoplasm typically included a grossly distended appendix filled with abundant mucin. b, c This low grade appendiceal mucinous neoplasm was characterized by a thin wall (b), lamina propria obliteration (c), fibrosis of the submucosa and muscularis propria (c), and low-grade dysplasia of the mucinous epithelium (c). This case was submitted as possible low grade appendiceal mucinous neoplasm, diagnosed by the consultant as low grade appendiceal mucinous neoplasm, and classified as an agreement. d-f In contrast, an appendix involved by a serrated polyp showed normal caliber bowel wall thickness (d, e) and readily identifiable three bowel wall layers on low power (e). Intermediate power showed a preserved lamina propria, non-fibrotic submucosa (f), and serrated epithelium with easily identifiable goblet cells (f). This case was submitted as possible low grade appendiceal mucinous neoplasm, diagnosed by the consultant as serrated polyp, and classified as a major disagreement-downgraded. g Moderately-differentiated mucinous adenocarcinoma of the appendix demonstrated a complex gross configuration with solid and cystic components, hemorrhage and necrosis, and no grossly identifiable appendix. h, i The mucinous backdrop comprised more than 50% of the overall lesion. High grade features were seen with increased cellularity, complex glandular architecture, necrosis, prominent nucleoli, and loss of nuclear polarity. Stromal invasion with desmoplasia and individual cell infiltration were also seen (not shown). Signet ring cells were not identified. This case was submitted as possible low grade appendiceal mucinous neoplasm, diagnosed by the consultant as moderately-differentiated mucinous adenocarcinoma (G2), and classified as a major disagreement-upgraded

Fig. 4
figure 4

Common low grade appendiceal mucinous neoplasm consultation themes. a, b Appendices involved by diverticular disease were diagnostic confounding issues. a Helpful diagnostic clues of a low grade appendiceal mucinous neoplasm included obliteration of the lamina propria and a fibrotic wall on low power, and low grade dysplasia of mucinous epithelium on intermediate power (not shown). b In contrast, diverticula retained the lamina propria and lacked dysplasia. Others have reported additional helpful features of diverticular disease include mucosal neuromas and epithelial disarray [36]. c A high grade appendiceal mucinous neoplasm was missed in one case. A high grade appendiceal mucinous neoplasm was defined as mucinous epithelium with all the architectural features of a low grade appendiceal mucinous neoplasm (not shown) and additionally high grade cytology (enlarged nuclei with loss of polarity, prominent nucleoli, and increased mitotic figures). This case was submitted as possible low grade appendiceal mucinous neoplasm, diagnosed by the consultant as high grade appendiceal mucinous neoplasm, and classified as a major disagreement-upgraded. d In contrast, a low grade appendiceal mucinous neoplasm is defined by low-grade cytology with retained nuclear polarity in mucinous epithelium. e, f The significance of extra-appendiceal mucin was a common source of consultation. In contrast to mucin displaced by artifact of gross contamination, for example, real mucin featured tissue infiltration and a tissue reaction. On low power, the extra-appendiceal mucin infiltrated around lymphoid aggregates (e, arrows) and contained small capillaries (e, f, asterisks) and dystrophic calcifications (f, bracket)

Goblet cell carcinoid-related lesions were the second most common consult

After low grade appendiceal mucinous neoplasm-related consults (49% of total consults, n = 21), the next most common consultation theme was the classification of goblet cell carcinoid-related lesions (19% of total consults, n = 8). The submitted diagnoses included the following: goblet cell carcinoid (n = 2), adenocarcinoma (n = 3), adenocarcinoid (n = 1), no submitted diagnosis [n = 2; both diagnosed by the consultants as goblet cell carcinoid with adenocarcinoma (adenocarcinoma ex-goblet cell carcinoid)]. Of the cases with submitted diagnoses (n = 6), most were in agreement with the submitted diagnosis (n = 5). The one disagreement was a case submitted as “adenocarcinoma”, but diagnosed by the consultant as “goblet cell carcinoid”. Although goblet cell carcinoid and adenocarcinoma are staged and treated similarly, the change to histologic classification was regarded as significant and, consequently, scored as major disagreement-downgrade. Fig. 5.

Fig. 5
figure 5

After low grade appendiceal mucinous neoplasm-related consults, the second most common consult theme related to the classification of goblet cell neoplasms. a, b Under the Tang classification, this lesion was classified “typical goblet cell carcinoid” (group A) based on intact appendiceal architecture on low power (a), and well-defined goblet cell clusters lacking significant atypia and desmoplasia on high power (b) [28]. Under the Yozu classification, this same lesion was classified “low grade goblet cell adenocarcinoma” (grade 1) based on >75% clustered growth pattern. c-d Under the Tang classification, this lesion was classified “adenocarcinoma ex-goblet cell carcinoid, signet ring cell type” (group B) based on destruction of the appendiceal wall on low power (c), and discohesive, single file infiltrative growth pattern, signet ring cells, and significant cytologic atypia seen on high power. This case lacked confluent growth. d Under the Yozu classification, this same lesion was classified as “high-grade goblet cell adenocarcinoma” (grade 3) based on the <50% clustered growth pattern

Discussion

Major consultation themes of this series included tumor and serrated polyp classification, criteria for invasion, the significance of extra-appendiceal mucin, and staging, particularly as these diagnostic issues related to low grade appendiceal mucinous neoplasms. Fig. 2.

Regarding tumor classification, a common diagnostic issue was distinguishing a low grade appendiceal mucinous neoplasm from a serrated polyp. This distinction was particularly challenging in low grade appendiceal mucinous neoplasms with a normal caliber appendix not yet entirely overrun by the low grade appendiceal mucinous neoplasm, or, low grade appendiceal mucinous neoplasms with a villiform surface. As reported by others, gross features favoring a diagnosis of low grade appendiceal mucinous neoplasm included a distended appendix with abundant luminal mucin and a thinned bowel wall [7,8,9,10,11], whereas, appendices involved by serrated polyps were usually of normal caliber thickness. The typical histologic features of a low grade appendiceal mucinous neoplasm seen at low power included an attenuated appendiceal wall, lamina propria obliteration, a pushing border, and mucin dissection of the fibrotic wall [7,8,9,10,11,12,13,14,15,16]. On intermediate power, low grade appendiceal mucinous neoplasms showed circumferential involvement of the appendiceal lumen by undulating or flat mucinous epithelium with low grade dysplasia, similar to the hyperchromasia and nuclear pseudostratification seen in conventional tubular adenomas [7,8,9,10,11,12,13,14,15,16]. In contrast, serrated polyps usually arose in appendices with normal bowel wall thickness, showed retention of the lamina propria, lacked a fibrotic wall, and displayed serrated epithelium with ovoid nuclei, eosinophilic cytoplasm, and readily identifiable goblet cells. Fig. 3.

The distinction between a low grade appendiceal mucinous neoplasm and adenocarcinoma represented a major consultation theme in this series. Overall, we identified that appendiceal diverticular disease and low grade appendiceal mucinous neoplasm’s broad, pushing border often raised concerns for invasion. Features that the consultants relied on for definitive invasion included tumor budding, small-angulated glands, individual cell infiltration, desmoplasia, small nests of neoplastic glands suspended in mucin, and complex glandular growth [7, 14, 17]. More recently a “small cellular mucin pool” pattern has been described as a feature of infiltration whereby small mucin pools containing neoplastic glands replace the appendiceal wall [11]. Other helpful features of invasion include the presence of signet ring cells, although one should be aware that epithelium suspended in mucin can simulate signet ring cells due to detachment and degeneration artifacts [18,19,20,21,22,23]. Helpful features of true signet rings include identification of the signet rings in viable tissue with tissue infiltration and a desmoplastic reaction [18]. Davison et al suggest that the signet ring component reach a minimum threshold criteria of >10% of the neoplasm for diagnosis [17], although this threshold is not required by the current American Joint Committee on Cancer (AJCC), 8th edition [24]. Worrisome gross features for invasion include complex gross configurations with readily identifiable necrosis, rupture, and hemorrhage (Fig. 3).

The concept of “high grade appendiceal mucinous neoplasm” was a diagnostic issue in one consult submitted as low grade appendiceal mucinous neoplasm. The diagnostic term “high grade appendiceal mucinous neoplasm” refers to appendiceal cases with the typical architecture of low grade appendiceal mucinous neoplasm, but with high-grade atypia as defined by loss of nuclear polarity, glandular complexity, increased nuclear: cytoplasmic ratio, prominent nucleoli, and/or increased mitotic figures [14] (Fig. 4). Importantly, the current AJCC, 8th edition recommends staging high grade appendiceal mucinous neoplasms as invasive adenocarcinoma due to a higher risk of recurrence, worse prognosis, and higher association with extra-appendiceal mucin with neoplastic cells, relative to low grade appendiceal mucinous neoplasms, in the small series available [7, 9, 16, 24, 25].

While there are no minimum criteria for extent of high grade atypia required to qualify for a diagnosis of high grade appendiceal mucinous neoplasm, one unequivocal focus, even if small, is sufficient for diagnosis per the Peritoneal Surface Oncology Group International [14]. By these standards, focal high grade atypia can be easy to miss in an entirely submitted, large appendix that commonly generates upwards of 40 glass slides. For practical purposes, the concept of high grade appendiceal mucinous neoplasm and when and how to report are controversial. In our centers, the finding of a single microscopic focus of high grade or questionable high grade component is reported as “predominantly low grade appendiceal mucinous neoplasm with focal areas of increased proliferation” [11, 17]. The corresponding note documents the percentage of involvement of the high grade component and the lesion is staged as a low grade appendiceal mucinous neoplasm, assuming the high grade component comprises < 10% of the overall lesion. This is based on Davison et al.’s finding of no significant difference in survival between patients with low grade mucinous neoplasms and those with predominantly low grade tumors and <10% of questionable microscopic foci of destructive invasion or nuclear atypia beyond that expected for low grade [17].

The classification of extra-appendiceal mucin in low grade appendiceal mucinous neoplasms was a major consultation theme. In the AJCC, 8th edition, a portion of the appendiceal staging is now based on the presence of mucin alone (without neoplastic cells) [24]. Preceding these changes, the terms “low grade appendiceal mucinous neoplasms with low-risk of recurrence” referred to low grade appendiceal mucinous neoplasms with extra-appendiceal mucin without neoplastic cells, and “low grade appendiceal mucinous neoplasms with high-risk of recurrence” referred to low grade appendiceal mucinous neoplasms with extra-appendiceal mucin with neoplastic cells [10]. The new staging system underscores the importance of carefully evaluating the entirely submitted appendix and the importance of accurately distinguishing extra-appendiceal mucin from an artifact. Owing to the abundance of tenacious mucin within these lesions, it is not uncommon for the mucin on the external surface of the appendix to represent an artifact and originate from contamination via gross manipulation. Helpful tips arguing against an artifact include the identification of mucin dissecting into tissue and the presence of a tissue reaction, such as small capillary and granulation tissue in-growth, mesothelial hyperplasia, and dystrophic calcifications [11, 15] (Fig. 4).

Appendiceal nomenclature has evolved quickly in the past few years and was an issue addressed in this series. The submitted diagnoses of appendiceal “hyperplastic polyps” and “sessile serrated adenomas/polyps” were corrected to “serrated polyps” in accordance with the Peritoneal Surface Oncology Group International consensus statement [14]. This nomenclature change reflects that appendiceal serrated polyps are genetically disparate from their colonic counterparts, despite their identical morphology: appendiceal serrated polyps are characterized by KRAS mutations and colonic sessile serrated adenomas/polyps are more often characterized by BRAF mutations [26]. The outdated term “cystadenoma” (and “mucinous cystic neoplasm”) was also corrected by the consultants to “low-grade appendiceal mucinous neoplasm” based on the Peritoneal Surface Oncology Group International consensus statement [14]. This shift in nomenclature clarifies the uncertain nature of this neoplasm and acknowledges its potential to cause concurrent or subsequent peritoneal disease, unlike a typical “adenoma” which is cured by simple excision.

After low grade appendiceal mucinous neoplasm-related consults, the classification of goblet cell neoplasms was the second most common consultation theme. Features contributing to these consults likely relate to their relatively uncommon incidence and evolving nomenclature and diagnostic criteria. In 1990, Burke et al proposed defining goblet cell carcinoid as goblet cell lesions with < 25% carcinomatous growth, and mixed carcinoid-adenocarcinoma as goblet cell lesions with > 50% carcinomatous growth [27]. In 2008, Tang et al. proposed an alternative classification: Typical goblet cell carcinoid (group A) referred to goblet cell lesions with well-defined goblet cell clusters lacking significant atypia, architectural distortion of the appendix, and desmoplasia; Adenocarcinoma ex goblet cell carcinoid, signet ring cell type (group B), referred to goblet cell lesions with goblet or signet ring cells, single cell infiltration, significant atypia, desmoplasia, destruction of the appendiceal wall, and lacking confluent growth; Adenocarcinoma ex-goblet cell carcinoid, poorly differentiated carcinoma (group C), referred to goblet cell lesions with at least focal goblet cells and >1 low power or 1 mm2 indistinguishable from a poorly-differentiated adenocarcinoma, which could appear as gland forming, confluent sheets, or undifferentiated carcinoma [28].

While Tang’s updated system offered more granular histologic grading criteria, it sometimes has practical challenges in application. The lack of a minimum criterion for a signet ring component can be problematic. Classifying lesions with some, but not all, of the diagnostic criteria for a particular diagnostic category, and reliably discerning the distinction between goblet cells versus signet ring cells, and desmoplasia versus fibrosis present unique challenges. To address some of these challenges, in 2015, Lee et al proposed a simplified two-tier histologic grading system of low- versus high-grade goblet cell carcinoid based on cytologic atypia, stromal desmoplasia, and solid growth pattern [29]. More recently, in 2018 Yozu et al. proposed classifying all goblet cell carcinoid-related lesions as goblet cell adenocarcinoma with histologic grading of low, intermediate, and high grade based on the percentage of tubular or clustered growth [30]. In support of this classification, independent investigators demonstrated that goblet cell carcinoid and goblet cell carcinoid with adenocarcinoma (adenocarcinoma ex-goblet cell carcinoid) share similar mutational backgrounds and likely represent a morphologic spectrum of the same histologic entity, which is distinct from the usual neuroendocrine tumors and adenocarcinomas [31,32,33]. For practical purposes, providing both the Yozu classification to acknowledge the most recent molecular advances, along with the Tang classification, may serve as a prudent approach until this evolving issue is clarified (Fig. 5). Regardless of grading system employed, goblet cell carcinoid continues to be staged as adenocarcinoma in the AJCC, 8th edition [24].

Of note, Valasek et al recently reported their experience of 46 appendiceal referral-based consults collected between 2014–2015 [34]. Consults were identified through a computer-assisted search of appendiceal cases with primary mucinous neoplasm evaluated by outside pathologists, and then through the in-house slide review at the University of California San Diego over a 2-year period during patient referral for oncologic care. They similarly reported appendiceal knowledge gaps as the distinction between low grade appendiceal mucinous neoplasm and adenocarcinoma, recognition of high grade appendiceal mucinous neoplasm, outdated terminology, and incomplete reporting elements, such as documenting extra-appendiceal neoplastic epithelium. An overall disagreement rate of 28.3% was reported, and all disagreements were attributed to over-interpretation by the originating pathologist. Both our study and Valasak et al’s identified higher disagreement rates than those previously reported in the range of 0.6–10% [1,2,3,4,5,6]. A possible explanation is that our studies focused exclusively on gastrointestinal-related cases, a field identified as the highest in disagreements according to the largest referral-consult study to date [1]. We suspect our overall 53% disagreement rate was higher than Valasak et al.’s 28.3% disagreement rate based on the nature of the consults. None of the consults from the Valasak et al study were specifically requested by the originating pathologist (extramural based consults), whereas, our study was exclusively extramural based. Extramural consults are expected to have higher rates of disagreement because they are initiated by the diagnostic uncertainty of the originating pathologist.

The strengths of our study were also its weaknesses. The study incorporates more than 1300 extramural consults because multiple pathologists at multiple centers were involved. This allowed for an expansive study of common diagnostic themes across the country. However, it was not feasible for all nine pathologists at all seven sites to review all 1365 consults prospectively due to logistical constraints. By studying exclusively extramural consults, as opposed to referral-based consults, this study was enriched for diagnostic issues. However, since all cases were extramural consults with no readily accessible medical records for review by the consultants, it was not possible to record all clinical history, management, and outcomes, and, consequently, the designation of “major” versus “minor” discrepancies based on management distinctions was imperfect as it was theoretical and at the judgment of the authors. While the consultants were all gastrointestinalspecialized pathologists from major academic medical centers, the authors do not assume that the consultants were unilaterally correct in all cases, since many of the grey-zone areas were complicated by sometimes divergent criteria, subjective interpretation, and limited clinical context. Last, despite the broad study design, the 43 appendiceal consults collected represented a limited study size compared to all gastrointestinal, liver, and pancreatobiliary consults. Had the study been larger, we anticipate we may have encountered other challenging themes, such as endometriosis simulating low grade appendiceal mucinous neoplasm and appendiceal adenomas of the conventional type [35].

Despite these weaknesses, this study represented the largest study of gastrointestinal focused extramural consults, encompassing the broadest geographic territory of academic medical centers. We report a 53% disagreement rate and identified common knowledge gaps in the appendix. These identified gaps can be leveraged to direct future academic studies, consensus guideline development, and improve educational opportunities.