Introduction

Hepatitis B is a major infectious disease in Asia. Patients with chronic hepatitis B carry high liver cirrhosis and liver cancer risks1,2. Before the development of antiviral therapies, there was no efficient strategy for preventing chronic hepatitis B from progressing to end-stage liver disease3,4. According to reports from the World Health Organization, 2 billion people are estimated to have experienced a hepatitis B virus (HBV) infection during their lifetime and more than 350 million are chronically infected. In addition, approximately 600,000 people die annually from acute or chronic HBV infection sequelae.

Liver fibrosis develops in patients with chronic hepatitis B because of repeated necroinflammation, followed by hepatocyte regeneration and the accumulation of extracellular matrix (ECM) proteins, such as collagen5,6. When hepatocytes are injured, the immune system is activated and growth factors are released that activate hepatic stellate cells to produce collagen and other fibrillar components7. These substances are deposited into the ECM space. During liver injury, the collagen degradation process is impaired, which further leads to excessive ECM protein accumulation8. The end stage of liver fibrosis is liver cirrhosis, which can lead to liver cancer or liver failure.

Liver biopsy examinations are considered the “gold standard” for liver fibrosis diagnosis, by which clinical strategies for patient management are drafted. Biopsy samples provide direct information regarding the degrees of liver injury and fibrosis. Descriptive or semi-quantitative scoring systems are used for staging the disease extent based on liver histology findings. The traditional method classifies liver biopsy samples as absent, mild, moderate, or severe fibrosis. The first semi-quantitative scoring system, which was introduced in the 1980s, comprised a number range to indicate different categories of pathological features classified based on severity9. Several routinely used scoring systems were subsequently developed, including the Knodell histological activity index (HAI)10, Scheuer11, ISHAK12 and METAVIR13 systems. However, the pathological features that must be assessed using these systems do not have strict objective definitions. Thus, grading and staging scores are to some extent based on the subjective opinions of pathologists. Inter- and intra-observer variations can be as high as 35% and obtaining highly reproducible results using these systems is difficult14,15,16.

Several studies have quantified liver fibrosis by employing image analysis systems17,18,19. These computer-aided systems are used to provide objective quantitative measurements that might exclude observer discrepancies. However, the staining-dependent processes used in these systems cause difficulties due to variations in the staining procedure between biopsy samples. Moreover, most of these systems define the liver fibrosis staging index only according to the total fibrosis area in the samples20,21,22. Other pathological features also play vital roles in liver fibrosis grading and staging23, necessitating computer-based systems that can provide measurements in addition to the total fibrosis area. Matalka et al.23 developed the first quantification system based on a pattern classification method; however, the features extracted from the images did not provide histopathological definitions. The system developed by Dioguardi24 permitted more than 10 quantitative measurements; however, these measurements were incompatible with those used by pathologists and poor correlations were found between the obtained measurements and commonly used scoring systems. Goodman et al.25 summarized the common histopathologic features of liver biopsies that can be considered in staging systems (primarily morphological features). However, incorporating all of these particular morphological features into a pathologist-readable system for clinical use is challenging.

Recently, a staining-free approach that utilizes second harmonic generation (SHG) and two-photon excitation fluorescence (TPEF) microscopy has been developed, allowing for a more quantitative method to assess fibrosis26,27. By eliminating the use of staining, this method reduces the variations commonly observed in the staining process. Furthermore, the use of SHG provides a true quantitative measurement of the collagen level in the absence of sample digestion, leading to preservation of the morphological information in images26,27. Collagen quantitation using SHG/TPEF microscopy has been evaluated in animal models but not in a large collection of human tissues with various stages of fibrosis26,27. Such studies should be completed before this technique can be applied to patient samples.

Using both monophasic and biphasic models to interpret a biological system is not uncommon; for example, biphasic waveforms are commonly observed in electrocardiograms28,29. Many automated external defibrillators are designed to incorporate this biphasic nature for defibrillation with different electric waves at different times30. In this study, we used a similar biphasic approach to create differing models for the early and late stages of fibrosis development. We measured and quantified nineteen morphological features of collagen using SHG and TPEF microscopy and developed two algorithms for assessing fibrosis using these nineteen parameters for both early and later fibrosis stages. The results were validated with the histopathological ISHAK fibrosis assessment and conventional collagen staining methods, followed by transmitted light microscopy imaging. The validation results showed that the multivariate generalized linear model was better for assessing the fibrosis progression pattern at late liver fibrosis stages, whereas a non-linear (support vector machine, SVM) approach was better for the assessment of early fibrosis stages.

Results

Advanced SHG/TPEF microscopic quantification of fibrillar collagen was highly correlated with ISHAK fibrosis scores

To elucidate the SHG/TPEF microscopy imaging capacity for liver fibrosis, 175 liver biopsy samples were obtained from chronic HBV-infected patients. The clinical parameters of the recruited patients are listed in Table 1. One hundred and five randomly selected samples were assigned to the training cohort and used for developing and optimizing the computer-based algorithm for liver fibrosis quantification. The other 70 samples were used as a validation set to assess the performances of the models.

Table 1 Basic clinical data for the patients whose liver tissues were submitted for fibrosis evaluation.

Figure 1A–D shows the images acquired using the conventional staining (Fig. 1A,B) and SHG/TPEF microscopy (Fig. 1C,D) methods. In SHG/TPEF imaging, fibrillar collagen was detected by SHG and is shown in green, whereas hepatocyte morphology was captured by TPEF microscopy and is shown in red. Visually, the SHG/TPEF technology resulted in the generation of a much more enhanced image with a better collagen distribution contrast. To minimize interference with normal-structured parenchyma and portal tract collagen, normal collagen readings were excluded. Figure 1E depicts the correlations identified by basic quantification (total collagen) or advanced quantification (Fibro-C-Index)27, excluding the normal-structured collagen signals31, with the ISHAK scores. The advanced quantification results were highly correlated with the ISHAK fibrosis scores.

Figure 1
figure 1

Comparison between SHG/TPEF imaging and linear microscopy after Masson’s trichrome staining.

(A) and (B) represent fibrotic liver sections stained with Masson’s trichrome. Collagen fibers are stained in blue and hepatocytes are stained in red. (C) and (D) are SHG/TPEF microscopy images. Fibrillar collagen (green) was detected by an SHG signal and hepatocyte morphology (red) was observed with TPEF. (E) Correlations between the ISHAK scores and SHG/TPEF microscopy quantification results of samples from the training cohort. The total collagen levels are represented by solid squares and the advanced collagen readings are represented by solid circles.

Morphological liver fibrosis features visualized by SHG/TPEF microscopy

With the SHG/TPEF system, liver fibrosis severity could be quantitatively measured by examining the collagen fiber amount and morphology. Figure 2 shows the collagen profiles obtained from liver fibrosis qualitative imaging of the liver biopsies, including the total collagen distribution (A, green), various collagen strings (B, colored region), collagen percentage of the total area (CPA) (C) and collagen crosslinks (D, white arrow). The SHG/TPEF system employed 19 features (Table 2) to quantitatively measure liver fibrosis severity. These features were classified into three categories. The first feature category (F1-3) was collagen percentage, which included the total, aggregated and distributed collagen percentages. The second feature category (F4-13) was based on collagen string detection. Once a collagen string was detected, the contour that enclosed the collagen pixels was defined. Subsequently, the convex hull and the area and perimeter of each collagen string contour were also measured (Fig. 3A). To further describe the shapes and properties of the detected collagen strings, for each string, an ellipse was applied that enclosed all of the pixels used to calculate the global parameters (e.g., breadth and length) (Fig. 3B). Collagen connectivity (Fig. 3C) within the strings was calculated by skeletonizing them and then detecting the intersections between them.

Table 2 Univariate linear regression analysis of the 19 SHG morphological features (F1 to F19) and ISHAK fibrosis scores.
Figure 2
figure 2

Morphological features used for qualitative liver fibrosis imaging.

The total liver collagen ((A), green), different collagen strings ((B), colored region), collagen percentage of the total area (C) and crosslinked collagen ((D), white arrow) are shown.

Figure 3
figure 3

Primary morphological structural properties.

Details of the morphological features, including the area, contour perimeter, convex hull, length, breadth and collagen crosslinks, are shown in (A–C).

The last set of features (F14-19) was generated according to the ratios of the different collagen string types.

Correlation between the morphological features observed with SHG/TPEF microscopy and ISHAK fibrosis stages

To identify the critical morphological features that can be used to assess liver fibrosis, the correlations between the 19 morphological features and ISHAK fibrosis scores were analyzed using univariate linear regression. Assuming that the ISHAK fibrosis scores were a linear function that increased from 1 to 6, we calculated the P values of all 19 features. The results are shown in Table 2. Note that most of the first and second category features (11 of 13 features) were highly correlated (P < 0.001), whereas only some of the third category features were correlated (P < 0.05).

Biphasic algorithm development for liver fibrosis assessment

In statistical analysis, collagen percentages (F1-3) were first employed to perform model fitting according to the ISHAK fibrosis scores (scores 1–6). Second, the collagen string properties (F4-13) were added. Finally, the collagen string ratios (F14-19) were included. A receiver operating characteristic (ROC) curve was used to assess the performance of the SHG/TPEF fibrosis score, using samples from the verification cohort (Table 3).

Table 3 Performances of the SHG/TPEF fibrosis scoring models (AUCs) (generalized linear model vs. SVM) using different combinations of morphological features.

Overall, the sophisticated non-linear SVM model was superior to the generalized linear model. In both models, the additional morphological features significantly increased the area under the ROC curve (AUC). Additionally, some features were more important for identifying different stages, e.g., the F4-13 string properties in the SVM model were used to identify early-stage progression and the F14-19 string ratios in the generalized linear model were used to identify late-stage progression.

Most combinations possessed high AUC values (most were higher than 0.7, but some were higher than 0.8), indicating that these features were crucial indices that were highly compatible with the clinically accepted ISHAK scores (i.e., these SHG-detected morphological features might facilitate efficient liver fibrosis assessments).

Among all of the models shown in Table 3, the largest AUC was measured while distinguishing between the non-cirrhotic (ISHAK stages 1–4) and cirrhotic (ISHAK 5–6) patients with the multivariate generalized linear model, which utilized all 19 included features (AUC = 0.829). Additionally, the generalized linear model was better at distinguishing early from advanced cirrhosis patients (12345 vs. 6; AUC = 0.826). However, the SVM model was optimal for classifying the early fibrotic stages (AUCs = 0.745, 0.774 and 0.797 for 1 vs. 2–6, 1–2 vs. 3–6 and 1–3 vs. 4–6, respectively). By utilizing the strengths of both model types, a biphasic strategy was proposed, the first step of which was to classify patients as either cirrhotic or non-cirrhotic using the generalized linear model with all 19 variables. If cirrhosis was detected, the same model was used for the score calculation. Otherwise, the SVM model was used to assess patients with early stage fibrosis.

Discussion

Liver biopsy remains the standard for monitoring fibrosis progression, in which a small piece of liver tissue is removed using a biopsy needle, stained, examined under a microscope and graded based on a descriptive or semi-quantitative scoring method32. In addition to the potential complications related to the biopsy procedure, inherent drawbacks exist, including potentials for sampling errors, staining variations and inter- and intra-observer variabilities, in the interpretation of histology results32,33.

With the development of both mode-locked lasers and highly sensitive optical sensors, non-linear optical microscopy, such as multi-photon excitation fluorescence and multi-harmonic generation, has become an affordable option for tissue imaging34,35. Because TPEF microscopy has a different excitation mechanism than SHG, TPEF signals can be easily separated from SHG signals using the appropriate detectors. Combined TPEF/SHG images reveal additional fibrotic liver data, such as precise collagen distributions and undistorted liver cell morphologies. In recent studies, TPEF microscopy has been widely used for imaging organ structure components and their dynamic interactions in biological tissues. Additionally, SHG microscopy has been increasingly used for measuring highly ordered structures without central symmetry, such as type I collagen, which is the dominant collagen type in fibrotic livers36,37,38.

Compared with conventional collagen imaging, which uses transmitted light microscopy of histological tissue sections that are stained with either Masson’s trichrome or picrosirius red, SHG microscopy has several advantages, as follows: (1) this method provides superior information regarding fibrillar collagen; (2) sample staining is unnecessary; thus, the staining variations that result from different stain batches, protocols, time-dependent fading and photobleaching26,39 are eliminated; (3) fluorophores are absent from tissues; thus, signals are unaffected by dye concentrations and photobleaching; (4) infrared excitation sources can be used, resulting in less scattering in the tissue; and finally, (5) SHG microscopy enables deeper tissue penetration for imaging purposes; hence, the 3D visualization of fiber architecture can be achieved40. Additionally, TPEF microscopy can provide complementary structural and cellular information41.

Fibrillar collagen formation occurs late in fibrogenesis. A traditional fibrosis microscopy evaluation is commonly based on restricted assessments related to later fibrotic changes, whereas features of the early stages of fibrogenesis are usually neglected. However, the use of new morphological criteria in conjunction with standard histopathological fibrosis scoring could improve the assessments of relatively small amounts of fibrillar collagen within fibrous tissues. This study evaluated 19 morphological features of SHG/TPEF images that are automatically included by the Genesis program. Morphological features such as collagen string formation and connectivity may allow for the identification of several stages in the progression of fibrillar collagen formation; therefore, these morphological features may enable the evaluation of early stages of fibrogenesis. Gailhouste et al.42 have highlighted the benefits of the linear scale provided by the SHG index by comparing it with the histological METAVIR scoring system. However, they have shown that the continuous scale provides greater accuracy than the semi-quantitative scoring system because it underestimates variables that demonstrate architectural differences in collagen, resulting in the restricted SHG indexing of fibrosis stages.

In this study, a biphasic model was developed to separately assess the early and late stages of fibrosis. We compared a multivariate generalized linear model with the SVM model. The results showed that the SVM approach was superior for detecting and differentiating between fibrosis stages (ISHAK 1–4), suggesting that collagen accumulation during fibrosis progression might not be a linear process. The finding that the biphasic approach displayed better accuracy than the monophasic approach further suggests that the pace of collagen accumulation differs during the early and late stages of fibrosis development. After incorporating the SVM into the biphasic approach, we successfully achieved an ROC of >0.82 for monitoring late liver fibrosis stages (cirrhosis) and an ROC of >0.75 for monitoring early liver fibrosis stages. We also observed that the addition of many parameters with relatively high P values (P > 0.001) resulted in better performance for assessing the fibrosis degree, especially for the non-linear models. For example, F14-19, which includes the different collagen string feature ratios, was not directly correlated with the degree of fibrosis; however, the addition of these parameters to the multivariate generalized linear model resulted in significant improvements in the ROC values in most cases.

AUC analysis revealed that the inclusion of some features, such as the string properties (F4-13), into the SVM models helped to improve the distinction of early stage fibrosis stage, whereas including the string ratios (F14-19) in the generalized linear models helped to improve late fibrosis stage distinction. The first part of these results is in agreement with the results of a previous study of early stage collagen remodeling26, which revealed alterations in collagen shapes but no substantial change in the total amount of collagen. Moreover, the second part of our findings suggests that although the total amount of collagen is altered during late liver fibrosis stages, the ratio of the different collagen string features is a better method for assessing fibrosis stages.

In this study, the main reason for liver biopsy of chronic hepatitis B patients was pre-treatment evaluation. Therefore, these patients were mostly in the immune clearance stage and were not evenly distributed over all clinical stages. As a result, many patients were positive for HBeAg but were already progressing to liver cirrhosis, suggesting a failure of HBeAg seroconversion despite repeated hepatitis flares. However, even during the immune clearance phase, patients who were positive for HBeAg still had a borderline lower fibrosis score (mean ± SD, 3.42 ± 1.41 vs. 3.06 ± 1.36; HBeAg negative vs. positive, P = 0.156).

In conclusion, this study developed a fully automated and quantitative assessment system employing SHG/TREF microscopy to specifically assess HBV-related fibrosis. We systematically optimized the parameters for quantitative SHG/TPEF liver tissue imaging and developed fully automated image analysis algorithms. Compared with the traditional staining systems, the present system enables enhanced collagen fiber detection and quantification for fibrosis research and clinical diagnosis. SHG/TPEF microscopy is a standard analytical tool and its use in combination with a computer-based detection and calculation system represents a powerful technique that can be applied in multi-center comparisons of liver fibrosis.

Methods

Tissue Preparation

The study cohort consisted of 175 liver biopsy samples from chronic HBV-infected patients who underwent liver biopsies at LinKo Chang Gung Memorial Hospital between 2000 and 2012. Clinical and pathological characteristics were obtained from the patients’ medical records. As described below, liver fibrosis was staged by pathologists according to the ISHAK scoring system and was then automatically evaluated with computational algorithms according to 19 morphological variables. Of the 175 samples, 105 (training cohort) were used for developing a computer-based algorithm with a close correlation to the ISHAK fibrosis score. The other 70 samples were used to validate the scoring method derived from the training system.

This study was approved by the Ethics Committee of Chang Gung Memorial Hospital. Written informed consent was obtained from each patient and the study was carried out in accordance with the approved guidelines.

Masson’s Trichrome Staining

Masson’s trichrome staining was performed at the Chang Gung Memorial Hospital Department of Anatomic Pathology as follows. First, 5-micron FFPE sections were deparaffinized and hydrated in distilled water. Then, Bouin’s fixative was used as a mordant for 1 h at 56 °C. The FFPE sections were cooled and washed in running water until the yellow coloring disappeared. The samples were stained in Weigert’s hematoxylin stain for 10 min, thoroughly washed in tap water for 10 min, stained again in an acid fuchsin solution for 15 min and then rinsed in distilled water for 3 min. After rinsing, the slides were treated with phosphomolybdic acid solution for 10 min and then rinsed in distilled water for 10 min. Finally, the slides were stained with a light-green solution for 2 min and rinsed in distilled water. After thorough dehydration using alcohol, the slides were mounted and coverslips were placed onto them.

Image Acquisition System

Images were acquired using a GenesisTM (HistoIndex Pte. Ltd, Singapore) system, in which SHG microscopy was used to visualize collagen and the other cell structures were visualized with TPEF microscopy. Details of the optical settings used can be obtained from Sun et al.26 and Tai et al.27.

The samples were laser-excited at 780 nm, SHG signals were recorded at 390 nm and TPEF signals were recorded at 550 nm. Images were acquired at 20× magnification with 512 × 512 pixel resolution and each image had a dimension of 200 × 200 μm. Multiple adjacent images were captured to encompass large areas. Nine images (3 × 3 images, 600 × 600 μm dimension) were acquired per site. Overall, five 3 × 3 images from five sites were obtained for each sample.

SHG Quantification

Total collagen percentages and previously described collagen features, including specific collagen strings and collagen connectivity-related measurements, were used to assess correlations with the ISHAK fibrosis scores. Nineteen morphological features were used in this study. These variables were divided into three groups, as follows:

Group 1

CPA (collagen percentage of the total area), Agg (the aggregated collagen percentage of the total area) and Dis (the distributed collagen percentage of the total area). The total collagen level was detected by SHG and segmented from the raw images. The aggregated and distributed collagen were differentiated with a low-pass filter26.

Group 2

NoStr (collagen string number), NoShortStr (short collagen string number), NoLongStr (long collagen string number), NoThinStr (thin collagen string number), NoThickStr (thick collagen string number), StrArea (total string area), StrLength (total string length), StrWidth (total string width), StrPerimeter (total string perimeter) and NoXlink (total crosslink number). The collagen strings were detected using a connected-component algorithm of binary images from a previous image segmentation. Each connected component was fitted with an ellipse and then the axes of the ellipses were used to divide the strings into long and short as well as thin and thick sub-groups. The total lengths, widths, areas and perimeters of the collagen strings were also measured. We used an image skeleton algorithm to detect branch points for determining crosslink numbers.

Group 3

NoShortStr/NoStr (ratio between the short string and total string numbers), NoLongStr/NoStr (ratio between the long string and total string numbers), NoShortStr/NoLongStr (ratio between the short string and long string numbers), NoThinStr/NoStr (ratio between the thin string and total string numbers), NoThickStr/NoStr (ratio between the thick string and total string numbers) and NoThinStr/NoThickStr (ratio between the thin string and thick string numbers). We found that these relative ratios improved the system only when absolute (normalized) values were used for Group 2.

Statistical Analysis and Model Construction

Univariate analyses were performed to assess correlations between individual image features and the ISHAK scores using linear regression. Two modeling method types were used for the patient classifications, which were based on multiple image features. In the first method, the dependent variable, y, logit, which was a fibrosis stage indicator, was a linear combination of n feature variables X1 … Xn, as shown in the following equation:

The coefficients β0, β1, …βn were estimated with the maximum likelihood method.

The second method used was the SVM model. A hyperplane was constructed for optimally classifying the subjects into fibrosis stages based on the feature values in the m-dimensional space (m ≥ n). The radial basis function (RBF) kernel was used as the distance measure between two subject vectors x and x’, i.e.,

where σ controls the width of the kernel.

Additional Information

How to cite this article: Wang, T.-H. et al. Automated biphasic morphological assessment of hepatitis B-related liver fibrosis using second harmonic generation microscopy. Sci. Rep. 5, 12962; doi: 10.1038/srep12962 (2015).