Introduction

Although vaccines have been available for almost 2 decades, chronic hepatitis-B virus (HBV) infection remains a major health problem worldwide. Current chronic hepatitis B is treated with either nucleotide analog or interferon. Interferon therapy successfully controls infection in about one-third of the chronically infected individuals (Delaney et al. 2001). Treatment with interferon-alpha often leads to the cessation of viral replication in 30–40% of the patients with chronic hepatitis B (Heintges et al. 2001). In addition, the 6–12 months of therapy is not only expensive, but also includes many side effects that can be debilitating in some patients (Liaw 2002; Wai and Lok 2002; Feld and Locarnini 2002). These tailorings of the therapies to individual patient problems have led to the search for predictors of the drug response rate to treatment. Current candidate predicting factors include viral genotypes, the ALT level, serum HBV DNA, female gender, fibrosis on liver biopsy and the serum fibronectin level (Kao 2002; Sakai et al. 2002; Kao et al. 2000a, b; Neudorf-Grauss et al. 2000; Helvaci et al. 1999).

With the advent of pharmacogenetics, and considering the host's unique genetic background, the role of DNA polymorphisms (including STRP and SNP) in association with treatment response has become increasingly appreciated and supported in a variety of illnesses. Hence, looking into such a topic may lead to important predictions of treatment response for interferon therapy, with its many displeasing side effects, in HBV patients. In particular in hepatitis-B disease, MHC I and MHC II class polymorphisms, TNF-α, mannose-binding protein, as well as eIF-2α and MxA in the interferon pathway, the SNPs have been suggested to affect the host immune and antiviral response and thus are associated with variable disease progression and treatment responses for previous studies (Hohler et al. 1997; Thursz et al. 1995; McNicholl et al. 2000; Scully et al. 1990; King et al. 2002).

Genetically structured populations may be composed of two or more subpopulations with distinct drug-reaction profiles and thus may be better considered separately in some contexts. Sometimes the population substructure is not obvious, and as a result, a sample may consist of heterogeneous subsamples from the population. This raises the question of the appropriate way to find the genetic background difference between distinct drug-reaction groups and of how to relate the inferred genetic structure to drug response.

Our study focuses on researching the differences of host genetic backgrounds between interferon responders and non-responders by highly polymorphic STR markers. We found 13 STR markers with different allele frequencies between the responder and non-responder groups. Principal component (PC) analysis and hierarchical clustering analysis were performed to find the genetically different patient subgroups with differing responsiveness by STR markers. It could provide a method for analyzing host genetic factors of distinct drug-reaction groups and a different view of the candidate gene SNP approach.

Materials and methods

Study subjects

We retrospectively enrolled 104 Chinese Han patients with chronic hepatitis B from our outpatient clinics at the National Taiwan University Hospital and Taipei Municipal Jen-Ai Hospital. Informed consent was signed and collected from each patient. All patients' blood samples were HbsAg (+) and HBeAg (+) with an elevated ALT that was at least twofold higher than the upper limits of normal for 6 months. HBV genotypes were determined using PCR-restriction fragment-length polymorphisms of the surface gene of HBV as in previous reports (Kao et al. 2000a, b; Lindh et al. 1999). Patients were excluded from receiving interferon therapy if they had any of the following criteria: neutrophil count <1,500 cells/mm3, Hgb <12 g/dl in women or 13 g/dl in men, or a platelet count of <90,000 cells/mm3, a history of poorly controlled thyroid disease, and a serum creatinine level >1.5 times the upper limit of normal at the initial screening. Live biopsy was performed in some patients to exclude cases with severe cirrhosis. Eligible patients received interferon-alpha (2a or 2b) at a dosage of 5–10 MU three times per week for 4–6 months, and were subsequently followed by a series of standard tests to assess the treatment response for more than 1 year. The definition of sustained responders to IFN treatment for chronic hepatitis B disease included the loss of HBeAg at 1 year after completing the treatment. Patients with concurrent hepatitis C or D infection were also excluded from the study. Our study protocol conforms to the ethical guidelines of the 1975 Declaration of Helsinki as reflected by the approval of our (National Taiwan University Hospital) institutional review committee.

DNA preparation

Genomic DNA was extracted from 99 unrelated Taiwanese chronic hepatitis patients. DNA was isolated from blood samples using QIAamp DNA Blood kit (QIAGEN, Valencia, CA) according to the manufacture’s instructions. The quality of the isolated genomic DNA was checked using agarose gel electrophoresis analysis, and the quantity was determined by spectrophotometer and stored at −80°C until use.

Microsatellite genotyping

Genotyping was performed using the ABI PRISM Linkage Mapping Sets MD-10 (400 markers; Applied Biosystems: ABI) and provided coverage of the human genome at 10-cM average resolution. Each marker set included a fluorescence-labeled forward primer and a tailing reverse primer. The PCR amplifications were carried out according to the manufacturer’s instructions. PCR products were separated on ABI 3700 DNA analyzers. The use of the GeneScan 500 LIZ as the internal size standard assists in polymorphic fragment length calling and allows more accurate allele calling and unambiguous comparison of data across experimental conditions. Genotypes were initially scored using Genescan and Genotyper (ABI) software and were then verified independently by three individuals without prior knowledge of the phenotype.

Statistical analysis

The associations of STR markers with the interferon treatment response were analyzed by Monte Carlo simulation (Sham and Curtis 1995). Various alleles of the significant markers were tested for linkage disequilibrium by the analysis of the contingency table. The significant alleles were tested for risk factors by odds ratio. The genotype contingency tables are constructed according to the specific allele with the significant P value and odds ratio. The meaningful genotypes are generated by testing with χ2 and significant odds ratio to the genotype contingency table. All significant marker genotype information was collected and the dataset transformed into a binary category; the meaningful genotype and the other genotype were represented with “1” and “0,” respectively. PC analysis and hierarchical clustering were performed on the transformed binary dataset. Major parts of the statistical analyses were performed by SAS 8.0.

Results

Patient characteristics

Sera were collected from 99 chronic hepatitis patients. Forty-six patients were responsive and 53 patients non-responsive to IFN α treatment. Demographic and clinical characteristics of IFN α-treated chronic hepatitis B patients are shown in Table 1. The sums of patients were not consistent in some comparing of the clinical outcome due to several patients with unavailable characteristics.

Table 1 Demographic and clinical characteristics of patients with response or non-response to IFN treatment

Markers correlated with the efficacy of interferon therapy

Two hundred twenty-one STR markers in chromosomes 1, 2, 3, 4, 5, 6, 7, 8, 9, 17, 21 and 22 were genotyped. The other chromosomes were not genotyped due to the subjects’ DNA shortages. After the data were analyzed, 13 STR markers were sought by allele and/or genotype frequency with a significant difference between the responder and non-responder groups (Table 2). For example, the D1S2890 226bp allele, defined as the E allele, was more frequent in the non-responder group and showed a significant odds ratio. Homozygousity of the D1S2890 E allele and heterozygousity of the D1S413 A allele showed significantly different frequencies between the responder and non-responder groups. The alleles of each locus were tested with the odds ratios and 2×2 χ2 test (data not shown), otherwise with alleles of the same locus. In Table 2, the column “Associated allele test” lists the alleles that were statistically significant in the test. Some loci, for example, D1S2890, have more than one associated allele. The genotype for each associated allele was classified as homozygous and heterozygous then tested with the odds ratio and 2×2 χ2 test (data not shown), otherwise with genotypes of the same locus. In Table 2, the column “Meaningful genotype test” lists the results of the test. The genotype of locus D1S2890 was shown to be a recessive model. Other loci, except D6S292, were shown to be dominant models. The marker D6S292 could not define a specific allele or genotype with different frequencies between the two patient groups and therefore was discarded from further analysis. Along with D6S292, D2S367 was also discarded because the Monte Carlo estimation did not reach a significant level. D17S785 was another marker not reaching a significant level with the Monte Carlo estimation, but its C-allele and genotype contained with the C-allele showed good differentiation between the two patient groups. We selected 11 of 13 STR markers, discarding D2S367 and D6S292, for further analysis.

Table 2 Testing results of the allele and/or genotype frequency of 13 STR markers that were significantly different between the responder and non-responder groups

Principal component analysis

The significant genotype datasets were transformed into the binary category; the meaningful genotype and the others genotype were represented with “1” and “0,” respectively. In loci with more than one meaningful genotype, a higher frequency than one was selected.

The principal component analysis is a technique that analyzes the information of uncorrelated composite variables without any significant loss from the original data sets. The scores resulting from the PC can also be used as input variables for further analysis of the data using multivariate techniques such as cluster analysis, regression analysis and discriminate analysis. The advantages of using PC scores is that the new variables are not correlated, and the problem of multicollinearity is avoided (Subhash 1996). With these characteristics, we use this method to analyze STR data that may exist multicollinearly.

Some common criteria were used to define the number of PCs. First, the eigenvalue of the components should be larger than one. Second, fewer components and larger culminate variance are better. SAS and R (the R Project for Statistical Computing, http://www.r-project.org) were used to perform PC analysis of significant STR markers. In Table 3, we chose five components that explain 62% of the total variability amount of these markers. Table 3 shows the weights of all markers for the first five components. The first component always has the largest proportion. In our case, this was 18%. D8S260 and D9S288 provide more information than the others in component 1, and so does D5S406 in component 2, D7S515 in component 3 and D4S391 in component 4. The positive and negative weights indicate the different directions of their information.

Table 3 Principal component score of the first five components

Figure 1a shows the pair-wise scatter plot of the five components. The non-responders are shown in the red triangle, and the black circle represents the responders. There are some patterns between component 1 and the others. These scatter plots correspond with the two groups in drug response, although several subjects are misgrouped. Other components do not have the same pattern as component 1, but they still provide some information. Figure 1b shows the scatter plot of the first three components. The two groups represented by the red triangle and black circle can be divided into obvious different parts. The ambiguous region is at the cross part between the two distinguishing parts in this plot. The PC scores were used to perform hierarchical clustering.

Fig. 1
figure 1

Principal component scores. a Two-dimensional scatter plot of each two of the first five component scores. b Three-dimensional scatter plot of the first three component scores. Open triangle: non-responder; open circle: responder

Hierarchical clustering analysis

Cluster analysis is a set of methods that can divide samples into several subgroups. These methods were often used to observe the nature of the features between/within the divided subgroups. If samplings are from several different groups, the members of the same group usually share some common attributes (or character). The attributes of subjects are similar in the same group and different between groups. Most of the cluster analysis methods divide subjects by their character similarities. It often considers the distances between two subjects as a measurement of similarity.

Obviously, some subdivisions were found in our data, which were performed by several distance matrixes and Ward’s cluster methods using SAS software. In Figs. 2, 3 these subjects could be separated into six groups unless the data were transformed by the PC score. Although the number of groups is the same, the subjects in each group are not exactly the same. The PC score is uncorrelated. It may imply some potential relations existed among markers when clustering subjects by these markers.

Fig. 2
figure 2

Hierarchical clustering using original binary dataset. Every group has their common character. R/NR is the count of responders and non-responders belonging to the same group

For the purpose of finding homogeneous responder groups, clustering using the PC score should be better at grouping than using the original binary data. The R/NR ratio of Fig. 2 shows that group 1 has a higher R/NR ratio (10/4). In Fig. 3, groups 3, 4 and 5 show higher R/NR ratios: 11/4, 7/2 and 11/2, respectively.

Discussion

Variation in allelic frequency among subpopulations, or clinal variation, is often thought to result from differential selection in various parts of the population in combination with population substructuring. Such spatial variation in allelic frequency may be temporally stable, but changing with time. Limited gene flow between the subpopulation gives a stable polymorphism that gradually disappears as the amount of gene flow increases (Hedrick 2000).

Fig. 3
figure 3

Hierarchical clustering using PC score transformed dataset. Every group has its common character. R/NR is the count of the responders and non-responders belonging to the same group

The patients’ inclusion criteria may act as a selection power for picking a specific genotype pool. All patients have the same phenotype (HBV carrier) except for the efficacy of interferon treatment. Variation in allelic frequency between responders and non-responders may present clinal variation and/or linkage disequilibrium with polymorphism of the causative genes. Theoretically, STR markers have more power to detect LD than diallelic markers such as SNP (Ott and Rabinowitz 1997; Chapman and Wijsman 1998). The sufficiently dense sets of microsatellite data can be used to make initial predictions about the level of short-range LD present in susceptibility regions identified by linkage study (Schulze et al. 2002).

Because of their underlying complexity, constructing predictive models of interferon-treated sustained response requires (1) more samples to perform guaranteed robust solutions for hierarchical clustering, (2) more genotyping to find more suitable markers, and (3) including environmental deviation or other non-genetic affection factors (i.e., gender, age, viral type, viral titer, etc.) for the analysis. Our pilot study suggests HB infection patients could be divided into genetic subgroups with variable sustained responses. The discoveries yield a powerful knowledge base to develop tools for predicting and changing the course of HB patient care and treatment.