Introduction

The concentrations of high-density-lipoprotein cholesterol (HDL-C) have been demonstrated to be negatively associated with the risk for coronary heart disease (CHD).1,2,3,4,5 Besides environmental and behavior factors, a genetic influence on HDL-C levels has been suggested. Several studies described familial aggregation of HDL-C.6,7,8,9,10,11,12,13 The correlation between the midparent and the child levels of HDL-C was found to be about 0.40 in an investigation in adolescent children living in the same household as their parents.9 A study in 139 families consisting of the parents and one child reported a correlation of 0.36 for HDL-C between the average parental and the child value.10 Similar correlations were reported in the Heritage Family Study.12 Weak parent–child correlations, however, were described in the Cuenca Study.13 Results from the NHLBI Family Heart Study (FHS) observed in randomly selected families a correlation of 0.32 and 0.29 for parent–child and sibling–sibling HDL-C, respectively, but less than 10% of this correlation was explained by lifestyle habits such as alcohol consumption, physical exercise and smoking.11 This is in accordance with findings in dizygotic twins showing a correlation of 0.39 for HDL-C, which was only slightly reduced after adjustment for lifestyle and other environmental variables.14 Furthermore, the genetic heritability was calculated to be 0.485±0.029 in data from the Lipid Research Clinics Program Family Study.15 While heritability for plasma HDL-C levels appears to be high, efforts to fit genetic models with complex segregation analysis have led to conflicting results. Some found no evidence for a major gene,16,17,18,19 others found evidence for major genes with various modes of transmission20,21,22,23,24,25,26 (for details see Table 1). We therefore analysed the segregation of HDL-C in the NHLBI FHS as well as in a separate set of Utah pedigrees.

Table 1 Studies investigating the segregation of HDL-C

Subjects and methods

Population

This analysis is based on the subjects recruited for the FHS, a population-based, multicenter study aimed to identify genetic and nongenetic determinants of cardiovascular disease. Probands were identified from three parent cohort studies, the Forsyth, (NC) and Minneapolis, (MN) cohorts of the ARIC study, the Framingham Heart Study Offspring Cohort, and the Utah Family Tree Study. Extended information about design and objectives of FHS is given elsewhere.27

In Phase I of FHS, 2000 randomly selected population-based participants and 2000 with family histories of CHD (about 500 from each group and each of the field centres) were identified among 14 592 middle-aged participants in epidemiological studies. Medical histories from these individuals, their parents and siblings, were used to calculate family risk scores which compared the number of reported and validated CHD events with the number expected based on the size, sex and age of family members. In phase II of the study, a total of 661 families with the highest risk scores and early onset of CHD, and 592 randomly sampled families (about 150 from each group and each of the field centres) had a detailed clinical and laboratory investigation including interviews by experienced interviewers. Clinical data obtained included anthropometry, blood pressure, blood samples for laboratory tests, electrocardiogram, pulmonary function, ultrasound of carotid arteries, and questionnaires supplying information about socio-economic status, lifestyle, medical history, medication use, food intake, nicotine and alcohol consumption, physical exercise, family structure, and psychosocial characteristics.

The analyses were repeated using data from 2013 individuals in 85 Utah extended pedigrees. The majority of these subjects (most within large pedigrees) were ascertained through sibships with two or more coronary heart disease deaths before the age of 55 (18 pedigrees), or stroke deaths before the age of 75 (9 pedigrees). These families were selected through a statewide file of 140 000 death certificates computer-linked to genealogical records for 1.2 million people in Utah. The rest of the sample (most within small pedigrees) was ascertained through probands selected randomly from participants of the Utah center for the Hypertension Detection and Follow-up Program. Detailed characteristics of the sample are described elsewhere.28 The subjects in this sample are 98% Caucasian; analyses were not performed on the small number of non-Caucasian subjects. There was no overlap between these families and the sample of families from Utah recruited for the FHS study.

Measurement of HDL-C and other phenotypes

After fasting for 12 h, blood samples were collected from FHS subjects in tubes with no additives. After low-speed centrifugation, serum samples were stored at −70°C until sufficient numbers of samples accumulated for shipment to the FHS Central Biochemistry Laboratory at the University of Minnesota for processing. HDL-C was measured in the supernatant after dextran sulphate-magnesium precipitation using enzymatic methods.29,30 Samples from Utah subjects were processed at the University of Utah Cardiovascular Genetics Program using similar methods.14 Anthropometric measurements were collected with subjects wearing scrub suits. Weight was measured using a balance scale, and height was measured using a vertical ruler mounted to a wall. Subjects were asked to bring medications to their clinic visit, and medication use was assessed through a review of these medications and through an interview. All other variables were collected through interviews performed by trained interviewers.

Statistical analysis

We adjusted HDL-C levels within gender for age, amount of alcohol consumption, current smoking status, body mass index, waist/hip ratio, amount of TV watching as a measure of physical inactivity, estrogen substitution for postmenopausal reasons, menopausal status, and recruitment centre by the SAS GLM procedure.31 Adjusted HDL-C values for segregation analysis were available from 560 randomly recruited Caucasian families (1896 subjects) and from 522 Caucasian families (1859 subjects) with high family risk score for CHD.

In the Utah sample, the same covariates were used with the exception of waist/hip ratio, TV watching, and recruitment centre. Waist circumference, physical activity by questionnaire, and watts needed to maintain 80% of maximum heart rate from a 4-minute bicycle test were used in the Utah HDL-C level adjustment. These adjustments were performed to ensure that a major gene effect was not disguised nor explained by effects of covariates. Other variables were not included in the final analysis after it was shown that they did not contribute to the regression model. Corrections for covariates were done separately within the FHS and Utah samples. Adjusted HDL-C levels were calculated by adding the overall mean HDL-C concentration to the calculated residuals of each individual.

Segregation analysis of adjusted HDL-C from the FHS sample was performed in the whole group of families, in the random group, and in the CHD group. For the Utah sample, the entire sample was analysed together. Maximum likelihood segregation analysis32,33 was done using the Pedigree Analysis Package (PAP) software.34 This analysis was performed before adjustment of HDL-C for log transformed triglyceride concentrations and again after adjustment. Adjustment for the above variables explained 22% of the HDL-C values in FHS females (11% in Utah females) and 16% in FHS males (12% in Utah males) when triglycerides were not considered. Inclusion of triglycerides in the model increased the sex-specific variance explained to 31% (19% for Utah) and 32% (18% for Utah), respectively. A standard segregation model was tested, in which we assumed the variation in HDL-C was caused by the sum of the independent effects of alleles L and H at a major locus, polygenic inheritance, and random environmental factors specific to each individual. We assumed that the major locus alleles were in Hardy–Weinberg equilibrium. Three HDL-C phenotypic means were defined, one for each genotype (μLL, μLH, and μHH).

Model fitting was performed searching both for an allele at a major gene locus associated with lower than average HDL-C levels as well as for an allele associated with higher than average HDL-C levels. In models investigating low HDL-C, μHH represents a mean close to the average HDL-C levels in the pedigree members and μLL represents the low HDL-C mean. When searching for evidence of an allele that raises HDL-C, μHH represents the mean of higher than average HDL-C levels. Several starting values for the three means were used. As a general rule we decreased the starting values of μLL and μLH in steps of 2 to 5 mg/dl when fitting a low HDL-C model and increased μLH and μHH when fitting a high HDL-C model. The gene frequency q is the frequency of the allele associated with the mean furthest from the average HDL-C level in the low and high HDL-C models. Homoscedasticity was assumed by defining one common parameter (σ) for the standard deviation of each genotypic category. Note that the variation within a genotypic category is divided into a polygenic component and random environmental effects specific to each individual. These effects were assumed to be normally distributed. Three transmission probabilities (τ1, τ2, and τ3) were also defined. The transmission probabilities reflect the chance that a parent will transmit a specific allele to an offspring (1, 1/2, and 0 for Mendelian transmission).

The GEMINI function35 was used to produce maximum-likelihood estimates of the parameters. Sub-models were compared to a more general model. The difference in −2×ln(likelihood) of each model is approximated by a χ2 distribution, with degrees of freedom equal to the difference in the number of parameters estimated in the two models. Tests were performed as follows: first, the most general model with the three τi estimated was generated. Both a Mendelian model and an environmental model were compared to this general model to test the transmission pattern. If the Mendelian model was accepted and the environmental model was rejected, dominant and recessive models were compared with the codominant model to determine the most likely mode of inheritance.

Ascertainment correction in the FHS group recruited for early onset CHD was performed for the youngest family member with coronary artery disease or for the proband if a family member with CHD was not available for examination. This correction was performed independently whether the CHD families were analysed as a separate group or whether they were analysed together with the random families. Within PAP, the likelihood of the pedigrees given the model parameters is divided by the likelihood of the ascertainment subset to produce the final corrected likelihood. Correction for ascertainment bias will minimize the risk of making false claims of a major gene effect.36 Analysis in the FHS group was also done without ascertainment correction which showed virtually the same results. Ascertainment corrections of the Utah pedigree data could not be done since ascertainment was on unexamined deceased founders of the extended pedigrees.

Results

Table 2 presents the variables used in adjusting HDL-C for the FHS data. Model fitting was performed separately for males and females. In the model which included triglycerides, HDL-C increased in both genders with increasing alcohol consumption, physical activity, and decreased with age, body mass index, waist/hip ratio, abdominal girth, triglycerides, smoking, and in males with increasing TV watching time. In females postmenopausal status was associated with lower and estrogen use with higher HDL-C values. The distribution of HDL-C after adjustment had a skewness of 0.73 and a kurtosis of 1.07. Adjustment in the Utah data yielded similar results (data not shown).

Table 2 Adjustment of variables influencing HDL-C concentrations in the whole FHS study population. The sex-specific variance explained 31 and 32% of HDL-C values in females and males, respectively

Modelling of an allele for low HDL-C in FHS families

HDL-C without adjustment for triglyceride concentra- tions

Independently of how we set the initial estimates in the search for a locus for low HDL-C (e.g. μHH=50 mg/dl, μLH=48 mg/dl, μLL=38 mg/dl), μLL changed to a high value above 70 mg/dl. In fact, μLL, μLH, and μHH changed to exactly the same values as we found them when using starting estimates favouring an allele that raises HDL-C above average (see below). There was no bimodality initially evident for a mean representing low values. Therefore we decided to omit those individuals with the upper 10% of HDL-C to see if either some outliers or possibly another allele related to high HDL-C caused this phenomenon. Although there was now significant bimodality for a low HDL-C mean in the data, we did not find any evidence for an allele responsible for low HDL-C levels (P<0.0001, P<0.0001, and P<0.005 rejecting the Mendelian vs the general model for all families, random families and the high-risk CHD families, respectively) (Table 3).

Table 3 Gene frequencies (q) and means (μLL, μLH, and μHH) for the different genetic models calculated for the whole, the random and the CHD group of FHS families. Results are given in the order q/μLLLHHH

When testing for low HDL-C levels before adjustment for triglyceride concentrations, the best model in all three FHS groups (whole, random and CHD group) was the environmental model (P=0.74, P=0.19 and P=0.53, respectively) (Table 3). The environmental model parameter estimates for the whole group were as follows: q=0.29, μHH=43.7, μLH=55.0, μLL=33.6, σ=6.10 and h2=0.91.

HDL-C adjusted for triglyceride concentrations

When we reran the analysis after additional adjustment of HDL-C for triglycerides, both the environmental and major gene models were rejected and we accepted only the general model (Table 3). The general model parameter estimates for the whole group were as follows: q=0.35, μHH=53.3, μLH=43.7, μLL=34.8, σ=5.74 and h2=0.21.

Modelling of an allele for high HDL-C in FHS fam- ilies

Evidence for segregation when modeling a mean for high HDL-C in the FHS families was further examined using all individuals without exclusion of subjects above or below a certain percentile. In the whole group as well as in the random group, the Mendelian and the environmental models were rejected independently of whether we adjusted for triglycerides or not, although the codominant model fitted better than the environmental model (Table 3). The general model also fitted the best for the CHD group when HDL-C was not adjusted for triglycerides. After adjustment, however, it was no longer possible to reject the codominant and the recessive model in the CHD group (Table 4). In both models, the means of the homozygote allele carriers and those without the allele were separated by about 25 mg/dl HDL-C (74.3 vs. 47.6 mg/dl for the codominant model and 73.9 vs. 48.2 mg/dl for the recessive model). The gene frequencies (q) for high HDL-C were 0.23 and 0.24 for the codominant and the recessive models, respectively. Interestingly, the mean values of the different genotypes as well as the gene frequencies (Table 3) were similar when compared to the random group of families, considering that the random group should generally have higher mean HDL-C values. The parameter estimates for the codominant model of the random group were q=0.19, μLL=48.1, μLH=53.1, μHH=82.1, σ=9.71, h2=0.53 and for the recessive model q=0.19, μLL=49.7, μLH and μHH=81.2, σ=9.99 and h2=0.56. Only the transmission probabilities and especially τ3 were markedly different between the CHD, random and whole group of families: τ3=0.25, 0.007 and fixed to 0, respectively.

Table 4 Genetic model parameters and estimates for high HDL-C levels (after adjustment for triglycerides) in the CHD group of FHS families

Segregation in the Utah pedigrees

As with the FHS data, the Utah data produced estimates indicating a higher than average mean for HDL-C regardless of starting values. Models using HDL-C not adjusted for triglyceride resulted in rejection of both the environmental and codominant models (P<0.0001; P=0.046, respectively). For HDL-C adjusted for triglycerides, support for a major gene was found (Table 5). The environmental model was rejected (P<0.0001), but the major gene model was not rejected (P=0.22 and P=0.054 for the codominant and the recessive models, respectively). In comparison with the general model, the dominant model could be rejected (P<0.0001). Parameter estimates for the best-fitting codominant model are remarkably similar to those found in the FHS data: q=0.26, μLL=45.8, μLH=49.0, μHH=67.5, σ=8.09, h2=0.54, with a separation of about 22 mg/dl HDL-C between the two homozygote means.

Table 5 Genetic model parameters for analyses of HDL-C levels (after adjustment for triglycerides) in the Utah extended pedigrees

Discussion

There is clear evidence of a high heritability of HDL-C concentrations.6,910,11,12,13,15 The mode of inheritance, however, is controversial, as can be seen in Table 1. Friedlander et al studied 3074 nuclear families with 1–2 children and found evidence for a recessive major gene for low HDL-C.23 The analysis remained inconclusive for high HDL-C in the same study population.24 Bucher et al37 proposed a factor for low HDL-C levels transmitted from parents to offspring in a large sample of 1806 families with nearly 7000 members. It was unclear whether this transmission followed a Mendelian pattern. Byard et al described a recessive major gene for low HDL-C concentrations.20 A study in 526 Mexican American individuals in 25 randomly ascertained families reported evidence for a major locus for low HDL-C with a codominant model.38 Others reported either evidence for a major gene with unknown mode of transmission21,22,25 or no evidence for a major gene locus.16,18,19 The reasons for these divergent results might be found in differences in study design, differences in ethnic compositions of study populations, recruitment conditions and data adjustment.

Although we investigated a large data set of families we did not find any evidence for a major gene allele that lowers HDL-C concentrations below average, independently of whether we adjusted for triglyceride concentrations or not. Bimodality for low HDL-C levels was not observed unless the highest 10% of HDL-C values were excluded. We accepted only the environmental model for low HDL-C before adjustment for triglycerides. The rejection of the environmental model after adjustment for triglycerides is in accordance with the inverse relation between triglyceride and HDL-C concentrations and the pronounced environmental influence on triglyceride levels.

It is surprising at first glance that we found evidence for a major allele leading to higher than average HDL-C values only in the group of families recruited for clustering of CHD, the group one would least expect to find it. We observed a recessive or codominant transmission of the inferred gene after adjusting for the effects of triglycerides on HDL-C. The mean separation and standard deviation of the different HDL-C genotype distributions as well as the gene frequencies were similar in the random and the whole group of families when compared to the CHD group. The major differences were found in the transmission probabilities which suggests that the models between the groups of families are very similar from the physiological point of view. A more pronounced confounding and/or heterogeneity in the random group of families resulting in different transmission probabilities could be an explanation for these findings. In other words, the random and CHD models for high HDL-C are very similar resulting in mild evidence for a high HDL-C allele in the entire dataset. There appears, however, to be some other factor that may be confounding the segregation in both groups but is just a little stronger in the random group (or by chance appears to be stronger).

Although we used a large number of families, these somewhat small family structures might have limited our analysis providing poor estimates of the transmission probabilities. The ascertainment correction for the FHS CHD families may not have been correct, given the complicated selection of these families. However, the correction made virtually no difference in the results when compared to an analysis without ascertainment correction (data not shown). To test for reproducibility, we obtained similar results from a set of large extended Utah pedigrees ascertained for early death due to stroke or CHD, or due to early hypertension. These similar results in an independent sample with greater segregation information support the evidence for a major allele for high HDL-C in subjects ascertained for CHD risk.

Conclusion

It appears that there is little or no evidence for a major gene allele for lower than average HDL-C levels in these data sets and that an environmental model fits best for low HDL-C. Surprisingly, the model that showed the most evidence for an allele leading to high HDL-C occurred in the CHD group, supported by similar results obtained in a Utah sample ascertained for risk of CHD and related phenotypes.