Main

Hepatic graft-versus-host disease occurs in patients post allogeneic stem cell or bone marrow transplant, and rarely in other situations, such as post solid organ transplantation or after blood transfusion. Over 80% of these patients develop liver abnormalities, commonly elevated liver chemistry tests, and over 40% develop hepatic graft-versus-host disease.1, 2 The etiology of the liver abnormalities may be related to the stem cell or bone marrow transplant, such as hepatic graft-versus-host disease or recurrent disease, its associated therapies, such as drug-induced liver injury, infections, and total parenteral nutrition-related injury, or owing to unrelated intrinsic liver disease.2, 3 In particular, drug-induced liver injury is a very challenging diagnosis in this setting and understandably so because these patients are receiving multiple drug regimens, many of which have potential hepatotoxicity. Furthermore, many of these entities in the differential diagnosis have similar clinical and biochemical presentations. As treatments and prognoses are vastly different, diagnosis of hepatic graft-versus-host disease is critical. Several hepatic graft-versus-host disease prediction scores have been developed; the National Institutes of Health Scoring System is common in clinical practice, and incorporates serum values of total bilirubin, alanine aminotransferase, and alkaline phosphatase.4, 5 However, recent studies have shown that this system has sensitivity of only 55% and specificity of 53%.6, 7 Thus, current non- and minimally invasive tests, and the probability scores based upon them, often fail to reliably distinguish hepatic graft-versus-host disease from its mimickers.

The pathologic diagnosis of hepatic graft-versus-host disease is challenging, and most previous studies lacked control groups.8, 9, 10, 11, 12, 13, 14, 15, 16, 17 Few studies with robust blinded methodology and control groups exist, and include Shulman et al.,18 who identified an extensive list of histologic features seen more in hepatic graft-versus-host disease patients compared with a control group, including acidophilic bodies, bile duct loss, intraepithelial lymphocytes, and absence of portal inflammation. Snover et al.19 described endotheliitis as being present more frequently in hepatic graft-versus-host disease compared with controls. Several groups have published consensus guidelines on the histologic features of hepatic graft-versus-host disease, and recommendations for diagnosis and terminology, often in three to four-tiered scoring systems.4, 20, 21 A study with a strong control group, which includes the most common clinical and histologic mimickers of hepatic graft-versus-host disease, such as drug-induced liver injury, is needed. Furthermore, simplification of this challenging diagnosis for pathologists, such as by combining significant histologic features into an algorithmic tool, would be of great utility.

The objective of the current study was, first, to identify any histologic features that may be more specific to hepatic graft-versus-host disease as compared with a control group, which specifically included cholestatic drug-induced liver injury and vanishing bile duct syndrome pattern of drug-induced liver injury. In addition, we sought to determine whether histologic differences exist between cholestatic and hepatitic clinical presentations of hepatic graft-versus-host disease. Finally, our goal was to develop a histologic-based algorithm for diagnosis of hepatic graft-versus-host disease using the identified histologic parameters.

Materials and methods

Forty liver biopsies from adult patients post allogeneic stem cell or bone marrow transplant with clinically confirmed hepatic graft-versus-host disease and absence of other intrinsic liver disease were examined, along with 44 control biopsies. The control etiologies included nine hepatitic drug-induced liver injury, nine steatohepatitis, eight cholestatic drug-induced liver injury, seven normal livers, six cholestatic total parenteral nutrition-related injury, and five vanishing bile duct syndrome pattern of drug-induced liver injury. Two of the cholestatic drug-induced liver injury controls had a cholestatic hepatitis histologic pattern. The drug-induced liver injury control cases had causality assessed by expert hepatologists, with identification of etiologic agent(s), and exclusion of other liver injuries with clinical, serologic, and pathologic testing.

Cases were selected via search of the laboratory information system of the Department of Pathology at The Mount Sinai Hospital using specific search terms such as ‘graft-versus-host disease’, ‘drug-induced liver injury’, ‘total parenteral nutrition’, ‘steatohepatitis’, and ‘vanishing bile duct syndrome’. Potential hepatic graft-versus-host disease cases were then cross-referenced by an exhaustive clinical chart review by an experienced hepatologist (TDS), and the 40 patients were selected based on the integrity of the diagnosis, absence or exclusion of confounders, completeness of information, and extensive clinical, serologic, microbiologic, and radiologic work-up.

All hematoxylin and eosin (H&E)-stained biopsy slides were blindly evaluated by two hepatopathologists (AES and MIF) for a panel of histologic features. Bile ducts were assessed for the presence or absence of apoptoses, cholangiocyte necrosis, cytoplasmic alteration (eg, eosinophilia, vacuolation), nuclear hyperchromasia, and nuclear disarray (crowding, loss of polarity), each expressed as a percent of bile ducts involved. Cholangiocyte apoptoses were defined as apoptotic bodies—condensed cells with karyorrhectic debris and often surrounded by a white space or halo—within the confines of the bile duct basement membrane. Cholangiocyte necrosis was defined as cholangiocyte nuclear loss, resulting in conspicuous gaps between nuclei. In addition, bile duct damage was scored from 0 to 3 (0=none, 1=single cholangiocyte necrosis or apoptosis, 2=confluent cholangiocyte necrosis, 3=bile duct loss), reflecting the average appearance of the portal tracts. Bile duct loss was assessed quantitatively, and expressed as a percent. Twenty-five percent of cases, including both controls and hepatic graft-versus-host disease, had cytokeratin 7 (CK7) performed as part of the original work-up; digitized slides (Scanscope, Aperio) were available on 76% and bile duct loss was assessed quantitatively. Acidophilic bodies were enumerated per 10 consecutive high power fields (HPF), beginning in the area of greatest acidophilic activity. Cholestasis was scored from 0 to 3 (0=none, 1=zone 3 confined, 2=involving zones 2 and 3, and 3=panzonal or bile infarct), based on the most affected area, and location (canalicular, hepatocellular) was noted. Ductular reaction was scored from 0 to 3 (0=none, 1=isolated, 2=extending beyond two hepatocyte plates, 3=extensive), based on the average appearance of the portal tracts. Intraepithelial lymphocytes were scored 0–3 (0=none, 1=1, 2=2–3, 3>3), based on the average number of lymphocytes within bile duct epithelium in the specimen. Furthermore, one author (AES), blinded to diagnosis, used digitized slides (Scanscope, Aperio), which were available on 75% of the cases, to enumerate bile duct cholangiocytes and intraepithelial lymphocytes. A percentage of intraepithelial lymphocytes per cholangiocytes was generated. Portal and lobular inflammation were scored using the Sheuer system.22 Presence and degree of portal eosinophils was recorded. The presence or absence of portal vein endotheliitis and hepatic venulitis were also noted.

Hepatic graft-versus-host disease cases were further stratified by clinical definitions of acute (<100 days post transplant) and chronic (≥100 days post transplant), and into cholestatic and hepatitic forms, where the hepatitic form was defined as hepatic graft-versus-host disease with aspartate aminotransferase and/or alanine aminotransferase at least 10 × the normal value.23

After statistically significant variables were identified, defined as P<0.05, these histologic features were combined into an algorithm, with the goal of creating a simple histologic formula while preserving diagnostic accuracy. After the final algorithm was selected, etiology of control cases was unmasked, such that false positives could be identified.

Statistical analysis was completed using Analyse-It version 3.9 for Excel. Comparison of histologic factors in hepatic graft-versus-host disease versus controls, and in acute versus chronic hepatic graft-versus-host disease, was completed using the Student t-test. Sensitivity, specificity, and accuracy were calculated using standard equations.

Results

Patient and biopsy characteristics of hepatic graft-versus-host disease and control groups are shown in Table 1.

Table 1 Patient and liver biopsy characteristics

Hepatic graft-versus-host disease cases, when compared with the general control group, had statistically greater bile duct damage (P<0.0001), bile duct loss (P=0.0002), cholestasis (P<0.0001), acidophilic bodies (P=0.0006), and statistically less ductular reaction (P<0.0001) (Table 2). Degree of bile duct loss was not significantly different when comparing H&E and CK7 quantification (P=0.17). Canalicular and hepatocellular cholestasis occurred together, and both were more frequent in hepatic graft-versus-host disease compared with controls (P<0.0001, P<0.0001). There was a trend toward more hepatic venulitis in the control population (P=0.0617). Other histologic parameters were not significantly different between the groups, including intraepithelial lymphocytes evaluated by two methods, even when normal controls were excluded (P=0.7854 by score, P=0.9545 by enumeration). There was a statistically significant association between degree of portal inflammation and presence of portal vein endotheliitis (P<0.0001). The spectrum of bile duct damage in hepatic graft-versus-host disease is shown in Figure 1. We found that all studied features of bile duct injury were significantly positively correlated and seen more frequently in hepatic graft-versus-host disease compared with controls (data not shown). References to ‘bile duct damage’ in the results and discussion are based on the bile duct damage score described in the methods. Lobular alterations in hepatic graft-versus-host disease are shown in Figure 2

Table 2 Histologic features of hepatic graft-versus-host disease cases and controls
Figure 1
figure 1

Spectrum of bile duct damage in hepatic graft-versus-host disease. (a) Cholangiocytes showing eosinophilic cytoplasm and variable nuclear hyperchromasia and crowding. An apoptotic body is present (arrow), 40 ×. (b) Cholangiocytes showing eosinophilic, vacuolated cytoplasm, loss of nuclear polarity, and regions of confluent necrosis with nuclear loss (white arrows). Note the adjacent marked cholestasis and bile infarct (black arrows), 20 ×. (c) Bile duct with areas of confluent cholangiocyte necrosis (arrows). There is minimal to no ductular reaction, 20 ×. (d) Portal tract lacking bile ducts, 40 ×. (e) CK7 immunohistochemistry highlighting intermediate hepatobiliary cells in areas of bile duct loss, 10 ×.

Figure 2
figure 2

Lobular changes in hepatic graft-versus-host disease. (a) Hepatocellular and canalicular cholestasis, with hepatocyte feathery degeneration, 40 × (b) An acidophilic body is present (top right), along with damaged bile ducts (bottom left), 40 ×.

When hepatic graft-versus-host disease cases were compared with a drug-induced liver injury-only control group, the same features maintained significance, whereas hepatic venulitis was seen significantly more often in the control group (Table 3).

Table 3 Histologic features of hepatic graft-versus-host disease cases versus drug-induced liver injury controls

When hepatic graft-versus-host disease cases were compared with a cholestatic drug-induced liver injury control group, bile duct loss, bile duct damage, acidophilic bodies/10 HPF, and ductular reaction retained significance, but cholestasis became non-significant; however, hepatic venulitis was significantly higher in the control group (Table 3).

When hepatic graft-versus-host disease cases were compared with a vanishing bile duct syndrome pattern of drug-induced liver injury control group, absence of ductular reaction was more often seen in hepatic graft-versus-host disease cases, and there was less bile duct loss (Table 3). A trend toward more hepatic venulitis in control cases was noted.

No histologic differences were found in comparing the acute (<100 days post transplant) to chronic (≥100 days post transplant) hepatic graft-versus-host disease groups.

Twenty-eight percent of the hepatic graft-versus-host disease cases met the definition of hepatitic subtype, with a mean alanine aminotransferase of 602 U/l (range 318–1134) and aspartate aminotransferase of 376 U/l (range 110–813), compared with the cholestatic subgroup, which had means of 139 U/l (range 46–270) and 85 U/l (range 17–253), respectively. Despite this hepatitic clinical presentation, there was no significant difference in any of the histologic features examined between the hepatitic and cholestatic hepatic graft-versus-host disease groups (Table 4).

Table 4 Histologic and biochemical features of hepatitic versus cholestatic hepatic graft-versus-host disease cases

Using an abridged scoring system (Table 5) based on the significant histologic findings, we created a simple histologic algorithm for the diagnosis of hepatic graft-versus-host disease with a three-tiered scoring system (Figure 3). Many iterations of the algorithm were evaluated, each using the significant histologic features revealed by data analysis. Weight was placed on bile duct damage and loss. As a simpler surrogate for all features of bile duct damage, we elected to use our bile duct damage score, based on cholangiocyte apoptoses and necrosis and bile duct loss. This score significantly positively correlated with other studied features of bile duct damage, and all cases which had other features of bile duct damage also had apoptoses and/or necrosis. We suspect that assessment apoptoses and necrosis may be less subject to inter-observer variability than other factors; further study is needed. With the goal of making the algorithm as user-friendly as possible, the analysis of other histologic features was streamlined, requiring ‘yes/no’ responses. The final proposed algorithm is simple, but provides a high accuracy in diagnosis of hepatic graft-versus-host disease and in distinction from the control groups.

Table 5 Hepatic graft-versus-host disease probability score
Figure 3
figure 3

Novel histologic algorithm to predict likelihood of hepatic graft-versus-host disease.

When applied to the hepatic graft-versus-host disease and control groups, this algorithm had a sensitivity of 93%, specificity of 93%, accuracy of 92%, negative predictive value of 93%, and positive predictive value of 90%. The most common cause of false positive cases was vanishing bile duct syndrome pattern of drug-induced liver injury. There were four false positives (three vanishing bile duct syndrome, one total parenteral nutrition) and three false negatives. Twenty-five percent of cases stratified to the ‘probable’ category.

Discussion

Hepatic graft-versus-host disease is a challenging clinical and pathologic diagnosis. In this study, we have identified histologic features that are significantly different in hepatic graft-versus-host disease compared with controls: greater bile duct loss, bile duct damage, cholestasis, acidophilic bodies, and less ductular reaction. This study is unique in that the control group includes the main clinical and pathologic differential diagnoses of hepatic graft-versus-host disease, including cholestatic total parenteral nutrition-related injury, cholestatic drug-induced liver injury, and vanishing bile duct syndrome pattern of drug-induced liver injury. In contrast to many consensus documents and studies, our findings did not show that hepatic graft-versus-host disease is enriched for intraepithelial lymphocytes nor portal inflammation; however, it does confirm the robustness of bile duct damage, bile duct loss, cholestasis, and acidophilic bodies in this diagnosis.9, 10, 15, 16, 18, 21 Differences may be owing to controls. In addition, our results emphasize importance of the absence of ductular reaction and hepatic venulitis in hepatic graft-versus-host disease.

Using our data, we have developed a highly accurate and easy-to-use histologic algorithm, the first of its kind for hepatic graft-versus-host disease. This algorithm may be superior to previously published scoring systems, which predict probability of hepatic graft-versus-host disease; however, further studies comparing these systems on the same study group are needed.6, 7 The algorithm is weighted, with bile duct damage and duct loss accounting for a large number of points; these features appear most important overall in separating hepatic graft-versus-host disease from all controls. Additional support points may be gained from other categories, with hepatic venulitis and absence of ductular reaction particularly helpful in comparison with drug-induced liver injury controls. Further studies, including prospective testing of this algorithm, and analysis of inter-observer variability in its application, are underway.

The current study emphasizes the importance of liver biopsy in this clinical scenario. Although there are risks associated with liver biopsy, these are cases in which clinical presentation, liver chemistry tests, and current scoring systems are unable to reliably discern the diagnosis of hepatic graft-versus-host disease, making histologic exam necessary. Although biopsy is more invasive than liver chemistry tests used in such scoring systems, the high accuracy of the diagnostic histologic algorithm should be weighed against the risks of biopsy. In addition, other investigators have demonstrated the usefulness of histologic features, including lobular inflammation and hepatocyte ballooning degeneration, on outcome in patients with confirmed hepatic graft-versus-host disease.13

Drug-induced liver injury is one of the most common clinical differential diagnoses for hepatic graft-versus-host disease. To our knowledge, this is the first study including drug-induced liver injury, and subdivided drug-induced liver injury patterns, within the control group. Our results show that, compared with drug-induced liver injury-only controls, the significant histologic features, namely bile duct loss, bile duct damage, cholestasis, acidophilic bodies, and absence of ductular reaction, are maintained. In addition, we report that hepatic venulitis is a feature in support of drug-induced liver injury, rather than hepatic graft-versus-host disease, and may be used to help differentiate the two. We hypothesize that hepatic venulitis in drug-induced liver injury cases may be related to drug-induced endothelial injury. The presence of hepatic, rather than portal, venulitis could be related to the release of drug metabolites into the sinusoids and hepatic venous drainage, resulting in elevated local concentrations, and/or higher susceptibility to injury owing to the relatively low oxygen environment. The scenario of concurrent hepatic graft-versus-host disease and drug-induced liver injury is possible, but not present in our study population and, thus, not addressed here.

When hepatic graft-versus-host disease cases were compared with a cholestatic drug-induced liver injury control subgroup, presence of cholestasis no longer was significant. However, hepatic graft-versus-host disease cases still showed significantly greater bile duct loss, bile duct damage, and acidophilic bodies, with significantly less ductular reaction and hepatic venulitis. These features can further assist the pathologist in this difficult differential diagnosis.

The most challenging differential is distinguishing hepatic graft-versus-host disease from vanishing bile duct syndrome pattern of drug-induced liver injury, and indeed it was the most common cause of false-positive results in our algorithm. When we compared our hepatic graft-versus-host disease cases to a vanishing bile duct syndrome pattern of drug-induced liver injury control subgroup, we found that hepatic graft-versus-host disease cases had significantly less bile duct loss and less ductular reaction. There was also a trend toward more hepatic venulitis in the vanishing bile duct syndrome pattern of drug-induced liver injury cohort. Thus, in this scenario, the presence of ductular reaction is crucial. It is our own personal observation that vanishing bile duct syndrome pattern of drug-induced liver injury biopsies tend to show a mixture of completely uninvolved portal tracts with affected ones, compared with hepatic graft-versus-host disease which shows diffuse involvement of portal tracts. Further study with a larger cohort of vanishing bile duct syndrome cases is warranted.

Hepatitic hepatic graft-versus-host disease, which has been defined by elevations of alanine and/or aspartate aminotransferase >10 × normal, represents 29–55% of hepatic graft-versus-host disease.23, 24, 25 Our data show that despite significantly elevated hepatitic liver chemistry tests, the histology of such cases was identical to those with a cholestatic presentation. Although none of our cases had the histology of severe acute hepatitis, reports of hepatic graft-versus-host disease resembling acute hepatitis, with lobular disarray and even bridging necrosis, exist.26, 27, 28, 29 It is unclear if these cases truly represent hepatic graft-versus-host disease, another process, or concurrent diagnoses; further study, and exclusion of other causes, such as drug-induced liver injury and viral infection, including Hepatitis E Virus, is needed.30 Hepatic graft-versus-host disease may also be associated with de novo liver diseases. Beleoken et al.31 noted several hepatic graft-versus-host disease patients who went on to develop an autoimmune hepatitis presentation after tapering or cessation of immunosuppression, and suggest that hepatic graft-versus-host disease may be a factor in the pathogenesis of de novo autoimmune liver disease. A primary biliary cholangitis-like picture has been similarly noted in murine models.32, 33, 34 For such cases, an immune-reconstitution syndrome should also be considered.

The strengths of the current study include the large cohort of clinically confirmed hepatic graft-versus-host disease patients with available liver biopsy tissue and control group, which includes the main differential considerations in this clinical and pathologic setting. This is the first study, to our knowledge, to incorporate drug-induced liver injury as part of the control group and have a subgroup analysis, and the first to compare histologic features of hepatic graft-versus-host disease with clinical hepatitic versus cholestatic presentation. The algorithm proposed demonstrates diagnostic reliability and may be superior to other available tools.

In practice, assessment may be limited by the adequacy of the liver biopsy. This may be affected by the type of biopsy; for example, transjugular liver biopsies, which most of this patient population may receive, may be of smaller gauge and contain less portal tracts when compared with a percutaneous sample. Bile duct loss may be challenging to recognize on H&E-stained sections. Ancillary studies, such as CK7 immunohistochemistry, may be useful. CK7 will highlight preserved bile ducts and ductular reaction. In addition, when a duct is truly lost, periportal hepatocytes become metaplastic, referred to as ‘intermediate hepatobiliary cells’, and show expression of cytoplasmic CK7. A rhodanine stain may highlight periportal copper accumulation associated with cholate stasis of chronic duct loss.

As part of prospective interrogation of the algorithm, one author (AES) has received two biopsies from patients post stem cell transplantation, and evaluated the pathology, including application of the algorithm, prior to reviewing any further history or clinical impression. The first biopsy showed diffuse bile duct damage, with areas of confluent cholangiocyte necrosis, 20% bile duct loss, scattered acidophilic bodies, mild cholestasis, and absence of ductular reaction and endotheliitis. It categorized into unequivocal hepatic graft-versus-host disease (score 5) per the algorithm. The patient was a 51-year-old male with history of chronic myelogenous leukemia and diffuse large B-cell lymphoma who underwent stem cell transplantation one month prior to biopsy. He was not able to tolerate the full regimen of immunosuppression after transplantation, and had received numerous antibiotics and antifungals in the prior month. Biopsy was performed for elevated liver transaminase activity levels and bilirubin, with a clinical differential of drug-induced liver injury, infection, and hepatic graft-versus-host disease. After the liver biopsy, the patient was placed on additional immunosuppression and fluconazole was discontinued. Liver enzymes and function improved in the following days, but did not normalize. The patient died of multiorgan failure 5 days after liver biopsy. Owing to the short survival post biopsy, and concurrent cessation of fluconazole and increase in immunosuppression, it is difficult to draw conclusions in this case. The second biopsy showed diffuse bile duct damage, 25% bile duct loss, marked cholestasis with bile infarcts, presence of acidophilic bodies, and absence of endotheliitis and ductular reaction. Per the algorithm, it corresponded to unequivocal hepatic graft-versus-host disease (score 6). The patient was a 53-year-old man who underwent stem cell transplantation for refractory follicular lymphoma four years prior, had a history of intestinal graft-versus-host disease treated two years prior, and had recent transformation to diffuse large B-cell lymphoma, at which time immunosuppressants were stopped. He developed new bilirubin and transaminase activity level elevation, and the clinical differential included hepatic graft-versus-host disease, infection (the patient had a history of Cytomegalovirus and shingles), and involvement by lymphoma. A skin biopsy was done at the same time as the liver biopsy, and later reported as compatible with graft-versus-host disease. After the liver biopsy, the patient was started on cyclosporine and prednisone, and liver function and transaminase activity levels improved. The patient died of lymphoma seven months later. In this case, the course, response to treatment, and lack of confounding treatment alterations support the diagnosis of hepatic graft-versus-host disease and accuracy of the algorithm. Prospective analysis of additional cases is required.

In conclusion, we have combined several specific significant histologic parameters into a simple, accurate hepatic graft-versus-host disease probability score and three-tiered scoring system, allowing for incorporation into clinical practice. This algorithm is able to accurately distinguish between mimickers and hepatic graft-versus-host disease, whether having a hepatitic or cholestatic presentation. In comparison with currently used scoring systems and consensus documents, which guide clinical decisions, our algorithm appears to provide improved sensitivity, specificity, and accuracy.