Main

Specialties such as gynecology, urology and gastroenterology are facing difficult dilemmas when trying to incorporate ‘preneoplastic lesions’ into management recommendations that may vary from observation only to more radical surgical approaches.1, 2, 3 Interobserver agreement studies have been introduced as a way to enhance consensus on terminology and classification of lesions.4, 5, 6, 7, 8 The study of pathogenesis and disease outcome would be facilitated by classification system that is devoid of subjective interpretation and could uniformly be applied worldwide, thus enabling clinicians and scientists to share and compare information.

Intrahepatic and extrahepatic cholangiocarcinoma develop through a multistep carcinogenesis, and are preceded by premalignant or in situ neoplastic lesions.9, 10 At least two major precursor lesions have been associated with the development of invasive cholangiocarcinoma. The first is a microscopic lesion of flat or low-papillary dysplastic epithelium, and known by names, such as biliary dysplasia, atypical biliary epithelium or carcinoma in situ.9 The second is an intraductal papillary neoplasm of the bile duct (biliary IPN), which is a macroscopic lesion characterized by prominent papillary proliferation of dysplastic epithelium with frequent intestinal metaplasia and mucin hypersecretion.11, 12, 13, 14 Biliary IPN is now regarded as the biliary counterpart of pancreatic intraductal papillary mucinous neoplasm.15, 16, 17 These two lesions are different in their pathological and clinical characteristics.11, 17 However, both of them may occur in the bile duct of patients suffering from primary sclerosing cholangitis (PSC), choledochal cyst or hepatolithiasis, all of which are known to predispose for the development of cholangiocarcinoma.11, 12, 13, 15

In 2005, we conducted an interobserver and intraobserver agreement study on the diagnosis of biliary dysplasia of the bile ducts affected by hepatolithiasis, and proposed a tentative set diagnostic criteria.18 We used the term ‘biliary intraepithelial neoplasia’ (BilIN) instead of biliary dysplasia, according to the World Health Organization classification of tumors.19 BilIN was classified into three categories (grades) based on the degree of cellular and structural atypia: BilIN-1, BilIN-2 and BilIN-3. These diagnostic criteria have been subject of criticism. For example, the proposal of BilIN in this previous study was based solely on the intrahepatic biliary lesions of patients with hepatolithiasis. It remained unclear if those diagnostic criteria would also apply to premalignant lesions of the extrahepatic bile ducts or those occurring in other diseases such as PSC and choledochal cyst. Another criticism was that all pathologists enrolled in the previous study were from Asian countries (Japan and Korea).18

We now report on the results of an international interobserver agreement study of the histological diagnosis of biliary dysplastic lesions conducted among 17 pathologists from the United States and Europe, in addition to Asia. This study did not include cases of biliary IPN. The aim of this study was to establish international histological criteria of BilIN, which could be applied to biliary non-invasive neoplastic lesions in the intrahepatic and extrahepatic bile ducts of various biliary diseases.

Materials and methods

Case Selection

Histological specimens were selected from the resected or explanted material of patients with PSC (n=6), choledochal cyst (n=7) and hepatolithiasis (n=10). The male to female ratios and average ages were as follows: PSC, 4:2 and 58.2 years (36–67 years); choledochal cyst, 3:4 and 25.9 years (18–35 years); hepatolithiasis, 3:7 and 53.7 years (range: 46–68 years). None of the cases was associated with an invasive cholangiocarcinoma. From these specimens, we selected a total of 30 lesions ranging from reactive to dysplastic biliary epithelium (10 lesions from each disease). The lesions from the PSC and hepatolithiasis patients were from large intrahepatic or hilar bile ducts, whereas those from choledochal cyst were of extrahepatic bile duct. No gallbladder lesions were included in this study. The lesions were not macroscopically detectable and did not histologically correspond to biliary IPN. Two digital images (original magnification; × 100 and × 400) of each lesion were taken at 5 600 000 pixels using a microscopic digital camera (DP50, Olympus Corporation, Tokyo, Japan). They were randomly arranged in a single file and numbered from case 1–30.

Terminology of Biliary Epithelial Lesions

The term ‘biliary dysplastic lesion’ has been commonly used to describe non-papillary premalignant lesions or in situ neoplastic lesions of the bile duct. In this study, we used the term ‘BilIN’, because this terminology was used in the World Health Organization classification of tumors, in a similar fashion as for premalignant lesions of other organs such as pancreas (pancreatic intraepithelial neoplasia (PanIN)) and prostate (prostatic intraepithelial neoplasia (PIN)).20, 21 In the biliary system, three grading systems have been commonly used. Some groups classify these lesions as mild, moderate or severe dysplasia. Other groups use low- and high-grade dysplasia and carcinoma in situ. In this study, non-invasive dysplastic lesions of biliary epithelium were classified into three histological grades; BilIN-1, BilIN-2 and BilIN-3. BilIN-3 includes so-called ‘carcinoma in situ’. The terminology and classification are in line with the current concept suggesting that the spectrum of these lesions reflects a multistep carcinogenic sequence with increasing neoplastic potential.

Study Design

The file containing the digital images of the 30 lesions was sent to a panel of 17 pathologists with expertise in hepatobiliary pathology. Only the digital file was available for diagnostic purposes. The clinical data and underlying conditions were not disclosed. The study participants were asked to categorize each lesion as reactive change, BilIN-1, BilIN-2 or BilIN-3, applying the tentative criteria reported previously.18 Briefly, BilIN-1 showed mild cellular/nuclear atypia such as nuclear membrane irregularity or nuclear enlargement with only a minimal disturbance of cellular polarity. BilIN-2 had evident cellular/nuclear atypia, but not enough to suggest overt carcinoma, with a focal disturbance of cellular polarity. BilIN-3 showed a diffuse disturbance of cellular polarity with or without distinct cellular/nuclear atypia corresponding to an overt carcinoma. Next, the participants were asked whether they could determine the underlying disease (hepatolithiasis, PSC or choledochal cyst) based on the nature of the epithelial changes of BilIN alone. Finally, they were asked to provide suggestions and modifications of the diagnostic criteria. Two of the authors (YN and YZ) performed the data analysis. The initial diagnostic criteria for BilIN were modified based on the suggestions. We subsequently obtain a consensus on the new classification criteria for BilIN.

Pathologists Enrolled

Seventeen pathologists participated in this study as participants and are all co-author of this article. Yasuni Nakanuma, Yoh Zen and Yutaka Atomi, who selected the specimens and analyzed the data, did not participate in the diagnostic evaluation. No pathologists except one (MS) participated in the previous interobserver agreement study.18 Two pathologists (DvL and AS) jointly diagnosed each lesion as a single participant. Therefore, the interobserver agreement study consisted of 17 pathologists and 16 participants. The pathologists enrolled in this study specialize in digestive system, particularly the hepatobiliary and pancreatic system.

Data Analysis

Interobserver agreement was evaluated according to the concordant rate (%) and the κ-value. Interpretations for the κ-value were as described previously,22 that is, <0.00, poor agreement; 0.00–0.20, slight agreement; 0.21–0.40, fair agreement; 0.41–0.60, moderate agreement; 0.61–0.80, substantial agreement; and 0.81–1.00, almost perfect agreement.

Results of the Interobserver Agreement Study

Among the 30 lesions assessed, the highest level of agreement was reached with respect to the diagnosis: reactive changes in six lesions, BilIN-1 in seven lesions, BilIN-2 in 13 lesions and BilIN-3 in 4 lesions. More than 9 of 16 participants agreed with the diagnosis of 21 of 30 lesions. Twelve of those lesions are shown in Figures 1, 2, 3 and 4. Similarly, a consensus of more than 11 participants was obtained for 12 lesions. Especially, almost perfect consensus was reached in two lesions in which 15 out of 16 participants diagnosed BilIN-3. The concordant rate and κ-value for the diagnosis of 30 lesions were 60.3 and 0.45%, respectively. κ-value for each lesion was 0.42 for reactive change, 0.40 for BilIN-1, 0.16 for BilIN-2 and 0.44 for BilIN-3 (Table 1). Then these participants were categorized into three groups based on their countries: the United States (five participants; NVA, LF, GYL, DvL KO and AS), Europe (six participants; KB, RC, PH, GK, AQ and FS) and Asia (five participants; HH, SMH, KN, MS and WT). There was no suggestion that were related to regional differences with respect to the diagnosis. As shown in Table 1, no significant difference was either observed in κ-values for the diagnosis of biliary lesions among three groups.

Figure 1
figure 1

Histological pictures of biliary epithelia diagnosed as reactive changes by more than nine participants (hematoxylin and eosin; upper, × 100; lower, × 400).

Figure 2
figure 2

Histological pictures of biliary lesions diagnosed as BilIN-1 by more than nine participants (hematoxylin and eosin; upper, × 100; lower, × 400).

Figure 3
figure 3

Histological pictures of biliary lesions diagnosed as BilIN-2 by more than nine participants (hematoxylin and eosin; upper, × 100; lower, × 400).

Figure 4
figure 4

Histological pictures of biliary lesions diagnosed as BilIN-3 by more than nine participants (hematoxylin and eosin; upper, × 100; lower, × 400).

Table 1 Kappa values of interobserver agreement on the diagnosis of reactive and dysplastic biliary epithelial lesions occurring in primary sclerosing cholangitis, choledochal cyst and hepatolithiasis

None of the participants could identify the preceding condition solely based on epithelial changes of BilIN, although seven participants suggested background disease from histological changes of the bile duct wall, such as lamellar fibrosis, inflammation and stromal edema in some cases. Interestingly, one participant suggested that BilIN could be clustered into six groups based on the patterns of epithelial changes themselves: (i) pseudopapillary eosinophilic epithelium with tufting and atypia (eight lesions; Figure 4a and c); (ii) long thin papilla (one lesion); (iii) short thick papilla (three lesions; Figures 3a and 4b); (iv) adenoma-like with glandular involvement (four lesions; Figure 3c); (v) micropapillary with/without intestinal metaplasia (eight lesions; Figures 2a, c and 3b); (vi) others (six lesions including reactive change). This participant did not describe the relationship between those histological groups and preceding conditions. However, it was found that seven out of eight lesions of (i) pseudopapillary eosinophilic epithelium with tufting and atypia arose in hepatolithiasis (three cases) and choledochal cyst (four cases). Three lesions of (iii) short thick papilla were all hepatolithiasis cases. (iv) Adenoma-like with glandular involvement mainly consisted of PSC cases (three out of four cases). Similarly, most lesions of (v) micropapillary with/without intestinal metaplasia derived from PSC (four cases) and choledochal cyst (three cases). That is, BilIN occurring in hepatolithiasis frequently showed (i) pseudopapillary eosinophilic epithelium with tufting and atypia or (iii) short thick papilla. BilIN of PSC commonly manifested (iv) adenoma-like with glandular involvement or (v) micropapillary with/without intestinal metaplasia. BilIN of choledochal cyst often showed (i) pseudopapillary eosinophilic epithelium with tufting and atypia or (v) micropapillary with/without intestinal metaplasia.

Proposal of Histological Criteria for BilIN

All participants agreed on the basic concept of the diagnostic criteria, including the terminology (BilIN) and their classification (BilIN-1, -2 and -3). In addition, several suggestions and opinions for the modification of diagnostic criteria were obtained. In the tentative criteria, micropapillary and non-papillary lesions were separately described in each lesion. In the present study, most participants mentioned that each lesion should be defined irrespective of their architectures because papillary lesions are rare in BilIN, and the term ‘papillary’ might cause confusion with the other papillary premalignant lesion (biliary IPN).

The next suggestion was about description of cellular and nuclear atypia. Detailed descriptions about cellular and nuclear atypia might assist many pathologists, particularly general pathologists, to promote a more general use of this classification. It was also suggested that mitosis in BilIN is one of the features indicating a diagnosis of BilIN-2 or BilIN-3. Intraepithelial neutrophils in the reactive epithelium might be a feature to differentiate it from BilIN-1. Taking these suggestions into consideration, we modified the criteria and establish the consensus classification system of BilIN, as shown in Table 2. We defined each lesion irrespective of architecture and for each described the detailed cellular and nuclear changes. We also added descriptions about glandular involvement, mitosis, nuclear location and intraepithelial neutrophils in the appropriate categories.

Table 2 Modified diagnostic criteria for reactive change of the biliary epithelium and biliary intraepithelial neoplasia

Additional opinions were provided by several participants. One participant mentioned that the two-grade system (low-grade BilIN and high-grade BilIN) might be easier to use than the three-grade system. However, we sometimes found dysplastic lesions difficult to classify simply into low or high grade. BilIN might be biologically divided into two categories (indolent or progressive), although an intermediate grade should be left until the morphological-biological correlation is elucidated. Two participants suggested that immunostaining could improve this classification. Molecule(s) related to the cell cycle or carcinogenesis, such as p53, p21, p16 and MIB-1, might be candidates for such use.23, 24, 25 The expressions of most of these molecules is gradually altered during multistep carcinogenesis, and their expression patterns overlap the grades of BilIN. Therefore, the histological criteria should only be based on morphological features at this time. However, the use of immunostaining may be helpful for BilIN classification and grading, and further studies are necessary to resolve this issue.

In practice, separating reactive inflammatory changes from truly neoplastic lesions is occasionally difficult, especially when the patients have been stented before surgery or biopsy. When in doubt, pathologists are advised to use the terminology of ‘indefinite for dysplasia’. However, one should try to avoid this practice as much as possible. In difficult cases, submission and review of additional sections and consultation with an expert pathologist can help solve diagnostic dilemma.

Discussion

In this study, we examined the interobserver agreement on the diagnosis of BilIN occurring as a complication of PSC, choledochal cyst and hepatolithiasis to obtain a consensus about the terminology and diagnostic criteria for those lesions. We used the diagnostic criteria of a previous study, as tentative guidelines.18 All pathologists enrolled in the present effort agreed on the basic concept of these criteria, including the terminology and three-grade classification. After the interobserver agreement study, we modified the criteria based on the suggestions and opinions obtained from the 17 pathologists. The major alteration concerns the description of the histological features in each lesion. There were short descriptions about cellular atypia and polarity in the tentative criteria; therefore, we added more detailed descriptions of the architecture, as well as the nuclear and cellular changes of each lesion. In addition, we deleted the subcategories of micropapillary and non-papillary lesions to avoid confusion about the discrimination between BilIN and biliary IPN.

The present interobserver agreement study revealed moderate agreement (60.3% and κ=0.45) on the diagnosis of biliary lesions based on the common diagnostic criteria. This agreement is similar to that of our previous study, and to those of studies concerning premalignant lesions of other organs.4, 6, 18, 26 The results somewhat encouraging since it was the first time for most participants to use these criteria. Furthermore, biliary neoplasms are not as commonly encountered and diagnosed compared to entities such as Barrett esophagus in the West or gastric dysplasia in the East, and thus the lack of exposure may also affect the results. The routine usage of the proposed criteria could improve diagnostic agreement.

Interestingly, the interobserver agreement of BilIN-2 was lower than those of reactive change, BilIN-1 and BilIN-3. This resembles the results obtained from the interobserver agreement study on PanIN.26 Hruban et al26 reported that the κ-values for the diagnosis of PanIN-1, PanIN-2 and PanIN-3 were 0.43, 0.14 and 0.42, respectively. Intermediate groups of three-grade systems might be characterized by low levels of agreement. If we use the two-grade system (low or high grade) in the BilIN classification, this issue might be resolved. However, we often encounter dysplastic lesions that are difficult to determine as low grade or high grade. We thought that the three-grade system seems appropriate at this time, although admittedly the clinical significance with regard to management, if a diagnosis was to be made on biopsy, was not evaluated in this study.

We used cases of PSC, choledochal cyst and hepatolithiasis in this study. All participants diagnosed each lesion without difficulties. It is suggested that the identical diagnostic criteria could be used for BilIN irrespective of the preceding biliary diseases. Similarly, intrahepatic and extrahepatic lesions could be diagnosed based on the common criteria. Although none of the participants could determine the background conditions, one suggested that BilIN might be clustered into six groups based on microscopic features of the biliary epithelium. Interestingly, there were some correlations between those groups and the preceding diseases. The histological features of each group might reflect different genetic and epigenetic alterations during cholangiocarcinogenesis in PSC, choledochal cyst and hepatolithiasis. Further analyses employing more cases of BilIN including some from diverse biliary diseases are mandatory to verify this hypothesis.

The histological criteria for BilIN obtained in the present study were based on lesions of intrahepatic and extrahepatic bile ducts. It is well known that similar non-invasive neoplastic lesions can occur in the gallbladder and also in the peribiliary gland. We suggest that the applicability of BilIN classification should be tested for gallbladder and peribiliary-gland lesions, because any consensus criteria have not been established for those lesions.

The relationship between BilIN and PanIN is an interesting question. We speculate that BilIN could be seen as the biliary counterpart of PanIN. In previous studies, we showed that BilIN and PanIN exhibited similar expression patterns of mucin core proteins (MUC1 and MUC2), suggesting that identical phenotypic changes occur in both BilIN and PanIN.11, 27 However, it has not been well documented whether similar genetic alterations also occur in BilIN and PanIN lesions. Thus, further evaluation of oncogenes and tumor-suppressor genes in BilIN and PanIN are necessary to resolve this issue.

Other premalignant lesions, except for BilIN, may occur in the intrahepatic and extrahepatic bile ducts. Biliary IPN is now believed to be the biliary counterpart of pancreatic intraductal papillary mucinous neoplasm.15, 16, 17, 28, 29, 30 The discrimination of BilIN and biliary IPN is usually based on the size of the lesion and the proliferation patterns. BilIN is a microscopic lesion and is not grossly identifiable. BilIN microscopically manifests as flat, pseudopapillary (loss of cellular polarity and pseudostratification) and micropapillary lesions (papillary growth with/without indistinct fibrovascular cores). In contrast, biliary IPN is grossly visible, and microscopically characterized by prominent papillary proliferation with distinct fibrovascular cores. The clinicopathological features are different between BilIN and biliary IPN; therefore, this discrimination seems important.11, 17 However, we rarely encounter premalignant lesions that are as difficult to determine as BilIN vs intraductal spreading of biliary IPN are. It was, therefore, suggested that a consensuses should be obtained in the near future about the discrimination of BilIN and biliary IPN, and the histological grading system of biliary IPN.

In conclusion, this international interobserver agreement study showed that the BilIN system was applicable to premalignant or non-invasive neoplastic lesions of the bile duct identified in the affected bile ducts of PSC, choledochal cyst and hepatolithiasis. We propose a series of revised diagnostic criteria for BilIN. This classification system could serve as ‘working formulation’ and will help the terminology and diagnostic criteria of non-invasive neoplastic lesions of the bile ducts in clinical and research fields.