Introduction

There is increasing support that the bile pigment bilirubin formed during haeme catabolism has antioxidative and cytoprotective properties.1 Most of this evidence comes from in vitro investigations2,3,4,5,6,7,8,9,10,11 and is in line with epidemiological studies describing lower bilirubin concentrations in patients with coronary artery disease (CAD)12,13,14,15,16,17,18 or in offspring with a parental history of heart attack.19 A major gene controlling high bilirubin levels has been implicated by two recent segregation analyses. A study in 1240 adults in 84 Utah pedigrees indicated that a major gene is responsible for elevated bilirubin levels in about 12% (q=0.34) of the subjects. This major gene explained about 27% of the variance in bilirubin levels.13 Similar results were found in the NHLBI Family Heart Study in 555 families of 1292 individuals.14 The most parsimonious model in both studies was a recessive model for high bilirubin levels and a significant polygenic effect.

The present study describes the first genome-wide linkage analysis for loci for high bilirubin levels in 48 large pedigrees with the same ethnicity as investigated in a previous segregation analysis.13

Methods

Subjects

This study is based on 1735 individuals (912 men and 823 women) in 48 pedigrees ascertained because of two or more first-degree relatives with early cardiovascular disease. The minimum and maximum pedigree sizes were 15 and 117 persons, respectively. Serum bilirubin levels and genotypes for linkage analysis were available in 877 of the 1735 Caucasian subjects. Their mean age and body mass index was 39±19 years and 27.0±7.0 kg/m2, respectively. The major proportion of subjects with unknown phenotype and genotypes were individuals in the upper generations of the multigenerational families who were already deceased but were needed to connect the pedigrees together.

Families were ascertained by discharge histories from the hospital20 and by the Utah Health Family Tree Project.21 Early cardiovascular disease was defined as the onset of at least one of the following conditions before the age of 56 years for males or before the age of 66 years for females: myocardial infarction (MI), coronary bypass surgery (CABG), coronary angioplasty (PTCA) and unstable angina pectoris.

Families who met the study criteria were sent a personal health questionnaire which contained questions on demographics, family history, anthropometrics, smoking habits, alcohol consumption, physical exercise, medications as well as hospitalization for cardiovascular-related diseases. Participants visited the University of Utah Cardiovascular Genetics Research Clinic for a blood draw after overnight fast, anthropometric measurements (height, weight, elbow diameter, abdominal and hip measurements and two sitting blood pressure measurements), a physical examination and an interview on cardiovascular history, stroke history, cardiovascular and respiratory symptoms, other medical conditions, illnesses and present therapy. Subjects were excluded from the analysis when they were not fasting, had abnormal liver or kidney function or when they had a history of kidney disease.

Bilirubin measurement

Total serum bilirubin was measured by a thin film adaptation of a diazonium salt, colorimetric method using the Vitros analyser (Ortho-Clinical Diagnostics, Inc. Rochester NY 14650, USA). In this method, dyphylline is used to dissociate unconjugated bilirubin from albumin. Then, unconjugated bilirubin, conjugated bilirubin, and albumin-linked bilirubin (delta) react with the diazonium salt 4-(N-carboxymethylsulfamyl) benzenediazonium hexafluorophosphate to produce azobilirubin chromophores which are read spectrophotometrically at 540 and 460 nM.22

Genotypes

DNA was extracted from frozen buffy coats using standard methods. For the genome search, 583 autosomal and four gonosomal markers with an average spacing of 5.7 cM were genotyped at Myriad Genetics, Inc. Most of the markers are trinucleotide or tetranucleotide repeats and are described in public databases such as Marshfield (http://research.marshfieldclinic.org/genetics), Genethon (http://www.genethon.fr) or Human Genome Database (GDB; http://gdbwww.gdb.org). PCR products were analysed on ABI 377 fluorescent sequencing machines using standard techniques. In-house allele calling programmes and extensive quality control algorithms were used to ensure high quality genotypes. Inheritance of all alleles was verified using the program Pedcheck.23 The genetic map used for all analysis and tables was internally generated from a larger superset of Utah pedigrees by Myriad Genetics, Inc., using the CRI-MAP program24 on 3916 meioses and closely corresponds to the Marshfield map.

Segregation and linkage analysis

Parametric linkage analysis was performed using bilirubin levels as a quantitative trait after adjusting for age, age2, age3, and sex. Fifteen of the 877 subjects who had adjusted bilirubin levels above the mean +5 standard deviations (=38.04 μmol/l) were excluded from the analysis. Recessive model parameters including the estimated gene frequency of q=0.34 and genotype means of 9.4 and 16.9 μmol/l were obtained from a previous segregation analysis.13 We rely on the model parameters of this recent study rather than on those from the present one because the pedigrees in the present study were not expanded by usual sequential sampling rules and persons with CAD or lipid abnormalities were preferentially genotyped. The previous study, however, ascertained pedigree members independently of whether some branches were associated with CAD or dyslipidemia. The disease-driven approach might be advantageous for linkage analysis but is prone to disturb segregation analysis. Nevertheless, we also performed maximum likelihood segregation analysis on the present pedigrees using the Pedigree Analysis Package (PAP)25 and compared the results for the four most important regions with parameter estimates from both datasets.

Linkage analysis was performed using robust multipoint linkage statistics proposed by Göring and Terwilliger26 as implemented in Myriad Genetics' MCLINK software program.27 This programme was developed to perform multipoint analysis on very large pedigrees with any number of markers and is robust to deviations from a normal distribution and to model mis-specifications. It generates multipoint marker haplotypes using a blocked Gibbs sampling approach, checking that the solution converges to the same estimates after multiple runs. Multipoint plots of the best linkage region are produced showing maximum LOD scores calculated at each marker across the chromosome. The haplotype solutions generated by MCLINK27 were also used for nonparametric linkage (NPL) statistics.28 The method is similar to the method developed by Kruglyak et al.29 but is extended to the analysis of quantitative traits. The main differences between these two methods were discussed recently.30 NPL statistics can detect genes with causal mutations that segregate among individuals with either low or high bilirubin values. It not only circumvents the problem of model mis-specifications but also has the ability to recognize the effects of multiple mutations, even if their penetrances, phenotypic effects, and modes of inheritance are quite different. An asymptotic formula was derived to adjust NPL scores, such that equal LOD and NPL scores translated to the same P values.30

Other statistical methods

We compared age and HDL cholesterol concentrations between CAD patients and controls by unpaired t-tests. Wilcoxon rank sum tests were used for bilirubin concentrations. Furthermore, we compared the bilirubin levels adjusted for age, age2, age3 and smoking status by multiple regression analysis. Since the data included in this analysis were based on families and a correlation of the variables within the families was expected, a mixed linear model was used for adjustment to accommodate this intrafamilial correlation by using an identification number for each family as a random effect in the model. Variables predicting the CAD status were identified by stepwise logistic regression analysis.

Results

Linkage analysis

The means±SD bilirubin concentrations were 12.1±5.6 μmol/l in men and 9.7±3.9 μmol/l in women, respectively. Before bilirubin concentrations were used for linkage analysis they were adjusted for age, age2, age3 and sex, since these variables showed significant correlations with bilirubin concentrations (all P values <0.0001 using GLM procedure). The only other covariate in Table 1 which significantly correlated with bilirubin levels was albumin. However, we did not adjust for albumin since this correlation can be explained by the binding protein function of albumin for bilirubin. Table 1 shows the clinical characteristics of the 862 study subjects who had bilirubin levels and genotypes determined and were used for linkage analysis.

Table 1 Clinical characteristics of the 862 study subjects who had bilirubin levels and genotypes determined and who were used for linkage analysis

The results from parametric linkage analysis are based on a recently described recessive model13 considering 587 markers. We repeated the analysis for chromosomes that showed LOD scores above 2.0 after estimating the model parameters from the pedigrees used in this linkage analysis. These pedigrees favoured a codominant rather than a recessive model. The genetic model parameters of the recessive and codominant models are provided in Table 2. MCLINK27 was applied to determine the marker-segregation pattern and the generated haplotype solutions were used to generate model-based multipoint LOD scores and non-parametric NPL scores.28 Forty-four markers showed either a multipoint LOD score or an NPL score greater than 1 (Table 3). Only four chromosomes harbored three or more markers with LOD and/or NPL scores above 1. The results for these chromosomes are presented in Figure 1 and Table 3.

Table 2 Genetic model parameters of bilirubin levels used for linkage analysis
Table 3 Markers with linkage scores above 1 in parametric (recessive model) and nonparametric linkage analysis
Figure 1
figure 1

Suggestive regions for linkage with elevated bilirubin levels. Results represent multipoint LOD scores from analysis with MCLINK using a recessive model. Results from non-parametric linkage analysis (NPL) are shown as a complementary approach. The arrow in the Figure for chromsome 2 indicates the location of the most important candidate gene (UGT1A1: uridine diphosphate glycosyltransferase 1).

The highest multipoint LOD scores and NPL scores were found for markers on the long arm of chromosome 2 (2q34–37). Nine markers in a region of about 60 cM showed LOD scores above one. NPL statistics identified six markers in this region with NPL scores above one. The highest NPL scores were obtained for markers D2S434 (3.29) and D2S1363 (3.22) within an intermarker distance of about 11 cM. The results from parametric analysis revealed a LOD score of 3.01 for marker D2S1363 using the recessive model parameters (Figure 1 and Table 3). When the parametric analysis was repeated using the codominant parameters, a LOD score of 2.51 was obtained.

The second most interesting region was observed on chromosome 18. Depending on the analysis method used, two different regions were identified which are suggestive for linkage. NPL scores identified a region around 50 cM with NPL scores of 2.79 (D18S974) and 2.10 (D18S535) (18q12). Parametric linkage analysis, however, did not provide positive LOD scores for this region. Instead, this method revealed LOD scores around 2.2 for the telomeric region on the short arm of chromosome 18 (18p11) (Figure 1 and Table 3). When this analysis was repeated using the codominant model parameters LOD scores increased to 2.60 (D18S976).

A third region linked to high bilirubin levels was located on the long arm of chromosome 10 (10q25–26) with seven markers with LOD scores mostly between 2 and 2.5 in the terminal 45 cM. These scores decreased when either parameters from the codominant model (D10S1237: LOD=1.11) or NPL scores were analysed (D10S1237: NPL score=1.55).

NPL analysis revealed a 7 cM region on chromosome 9 (9q21) which included four markers with NPL scores between 2.1 and 2.4. LOD scores remained below 1.0 when parametric analysis was performed with either recessive or codominant parameters.

Other markers which showed LOD or NPL scores above 1.0 were single observations which were usually not supported by positive scores of adjacent markers.

Association of bilirubin with CAD

Since this study is based on pedigrees ascertained for two or more first-degree relatives with early CAD, it allowed us to compare bilirubin levels between subjects with and without CAD. We matched 161 male patients with CAD and twice the number of male controls for age and observed significantly lower concentrations in patients (11.3±4.6 vs 12.7±5.3 μmol/l, P=0.005) (Table 4). Logistic regression analysis showed HDL cholesterol (P=0.009) and bilirubin concentrations (P=0.043) to be significantly associated with CAD. Furthermore, we matched five times the number of female controls for age to the 41 female patients with CAD and observed a tendency to lower bilirubin levels in CAD patients which, however, did not reach statistical significance (Table 4).

Table 4 Comparison of patients with coronary artery disease (CAD) and controls matched for age and stratified by gender

Discussion

Evidence for anti-atherogeneity of bilirubin

Previous epidemiological studies described an association between low bilirubin levels and coronary heart disease.12,13,14,15,16,17,18,19 It was found that plasma bilirubin levels correlated inversely with known CAD risk factors, such as smoking, LDL cholesterol, diabetes, and obesity, and correlated directly with HDL cholesterol.19,31 The relation between low bilirubin levels and CAD, however, remained significant after adjustment for known CAD risk factors.16,31 This association was further confirmed in the prospective British Regional Heart Study that observed 737 major ischemic heart disease events in 7685 middle-aged men during 11.5 years of follow-up.15 A small case-control study found decreased values of bilirubin indicative of the presence of atherosclerosis in smokers.32 The Family Heart Study pointed to a gender-specific effect of bilirubin on CAD since it observed an association of borderline significance in males but not in females.14 This is in accordance with our results showing only significant differences for bilirubin levels in males but not in females (Table 4) and with results from the prospective Framingham Offspring Study. The latter suggested higher bilirubin levels to be associated with a lower cardiovascular risk in men, but the pattern was unclear in women.18 Ten-year follow-up mortality data from Belgium observed an inverse association between bilirubin levels and all-cause and cancer mortality but not cardiovascular mortality in men. None of these associations was observed in women.33

The association of bilirubin with atherosclerosis might be explained by in vitro findings showing antioxidative and cytoprotective properties.2,3,4,5,6,7,8,9,10,11 Bilirubin scavenges peroxyl radicals and suppresses the oxidation in liposomes more efficiently than alpha-tocopherol.2 The antioxidative properties may be independent of whether bilirubin is unconjugated, conjugated, free, or albumin-bound.3,4,6 Recent investigations proposed bilirubin to be an anti-atherogenic endproduct in the heme-oxygenase-carbon monoxide signalling pathway.9,34,35,36 The cytoprotective properties of bilirubin may be explained by its inhibition of protein kinase C (PKC).10,11,37 Haeme oxygenase activity is related to a faster resolution of inflammation, whereas the inhibition of this enzyme seems to increase the inflammatory response.38

Familial aggregation and linkage analysis for high bilirubin levels

Familial aggregation of bilirubin levels was supported by recent segregation analysis on 1240 adults in 84 Utah pedigrees suggesting a major gene controlling bilirubin levels.13 The parameters obtained from the segregation model were used in the linkage analysis. It was estimated that the major gene was responsible for elevated bilirubin levels in 12% (q=0.34) of the subjects explaining 27% of the variance in bilirubin levels.13 An ascertainment correction was not possible because the CAD proband founders of the pedigrees were deceased. Therefore our estimated gene frequency for high, protective bilirubin levels might be lower than in a random population since these CAD pedigrees might have lower rather than elevated bilirubin levels. A segregation model for bilirubin in the Family Heart Study whose families were ascertained half at random and half for familial CAD and in which ascertainment correction was possible showed a higher gene frequency (q=0.48) but otherwise very similar parameters.14 The gene frequency estimate was even lower in the current set of pedigrees probably because there was an even greater selection of high-risk CAD families. Using the parameter estimates from the Utah pedigrees,13 we found three chromosomal regions that were linked to high bilirubin levels and a further region when we used model-free NPL statistics. The most important region is on chromosome 2q34–37 including marker D2S1363 (NPL score=3.22). This marker is approximately 12 cM or 8 Mb from an important candidate gene for high bilirubin levels which is uridine diphosphate glycosyltransferase 1 (UGT1A1).39 Mutations in this gene are responsible for Crigler-Najjar syndrome I40,41,42 resulting in an absence of enzymatic bilirubin glucoronidation and Crigler-Najjar syndrome II43,44 with a severe deficiency of the enzyme. Both syndromes are very rare (1 in 1 million births) and are often but not always accompanied by a mild hyperbilirubinemia. A further syndrome with mild unconjugated hyperbilirubinemia is Gilbert syndrome which was suggested to have a frequency between 2 and 12%.45,46,47,48 One of the most important polymorphisms responsible for this syndrome is a polymorphism in the promoter of the UGT1A1 gene.49 Promoters containing seven thymine adenine (TA) repeats have been reported to be less active than the wild-type six repeats, and the serum bilirubin levels of persons homozygous for seven repeats have been found to be higher than those with the wild-type six repeats.49,50,51 Bosma et al. reported a frequency of the seven repeat allele of about 40%49 which is very close to the results from the previous segregation analysis in Utah pedigrees13 and in the NHLBI Family Heart Study.14 Whether this gene and/or other genes close to this gene are responsible for the linkage results for the high bilirubin levels remains to be determined in future studies. Interestingly, a recent study found a significantly lower prevalence rate of ischaemic heart disease in Gilbert syndrome when compared to a general population.52 This is in line with the inverse association between bilirubin and atherosclerosis found in this and several other studies.12,13,14,15,16,17,1819 Since NADH is involved in the metabolism of bilirubin, other candidates in this region are proteins and subcomplexes of the NADH dehydrogenase (NDUFS1, NDUFA10 and NDUFB3). Furthermore, CYP27A1 codes for the steroid 27-hydroxylase and is a cytochrome P450 enzyme of bile acid synthesis.53

The other regions suggestive for linkage are located on chromosomes 9q21, 10q25–26 and 18q12. No obvious candidate genes are available in these regions.

We repeated the analysis for chromosome 2, 9, 10 and 18 by using parameters obtained from segregation analysis of the pedigrees used for linkage analysis. Even if this analysis favoured a codominant rather than a recessive model, we observed for chromosomes 2, 9 and 18 very similar LOD scores. Parametric (LOD scores) and non-parametric (NPL scores) linkage analysis gave virtually the same results for chromosome 2. The discordance between methods for other chromosomes might correspond to mis-specifications of the parametric model or reduced power of the nonparametric method.

Methodological implications

Despite initial enthusiasm about the concept of using linkage to identify genes related to common, multifactorial diseases or phenotypes, the results of many genome searches have been somewhat disappointing. The underlying heterogeneity of the measured phenotype and/or the small effects of any single gene on the phenotype may greatly reduce the power of linkage to identify such genes. This paper shows that there is a likelihood of success in identifying genes related to some phenotypes by linkage analysis. Two prior studies identified evidence for segregation of a major gene for bilirubin. This study shows that this major gene likely is located on chromosome 2. Polymorphisms in the promoter of the UGT1A1 gene, the most likely candidate within this region, are known to be associated with Gilbert syndrome causing hyperbilirubinemia and the associated gene frequency appears to be very close to the estimates from segregation analyses. These results and results from other linkage studies54,55 suggest that a linkage approach, especially in large pedigrees, may still be successful.

Conclusions

These results provide evidence that loci influencing bilirubin variation exist on chromosomes 2q34–37, 9q21, 10q25–26, and 18q12 and confirms the association of low bilirubin levels with coronary artery disease in males. Future studies have to narrow the region and should study the association of polymorphisms in the promoter of the UGT1A1 gene and CAD.