Abstract
A scan of 4,190 single-nucleotide polymorphisms (SNPs) in 199 different genes using 38 different quantitative traits to search for associations between genotypes and phenotypes detected an association between the genotypes at rs671 of ALDH2 and gamma-glutamyltranspeptidase (gamma-GTP) levels. We examined the associations between five factors such as gender, age, rs671 genotype, alcohol-drinking habit, and serum gamma-GTP level and found that all pairs were associated except for the pair of rs671 genotype and gender and rs671 genotype and age. We further analyzed the data by both multiple regression and subgroup analyses and found that the associations between rs671 genotype and alcohol-drinking habit, alcohol-drinking habit and gamma-GTP level, gender and gamma-GTP, and age and gamma-GTP were independent of other factors. Conversely, the association between rs671 genotype and gamma-GTP level was dependent on alcohol-drinking habit. Associations between genetic and environmental factors will become a focus of medical and biological studies. Our study has shown that (1) a large sample size combined with a replication study is necessary to overcome the multiple-comparison problem, and (2) subgroup analysis along with logistic and linear multiple regression analysis may be useful to dissect a complicated relationship.
Similar content being viewed by others
Introduction
Due to the availability of accurate genotyping systems for thousands or millions of single-nucleotide polymorphisms (SNPs), genetic association studies are now a focus of medical research (Kamatani et al. 2004; Rosalki 1973, 1974; Boone et al. 1977; Spencer-Peet et al. 1975; Teschke et al. 1977; Impraim et al. 1982). One problem in using thousands of SNPs is multiple comparisons. Thus, if the test is performed using thousands of SNPs, a certain number of SNPs show very low P values, even if they are not truly associated with the traits. To overcome the problem of multiple testing, such methods as Bonferroni’s correction are used; however, in this case, extremely low P values are required for SNPs to be truly associated with the traits.
Another problem in association studies for multifactorial diseases is relatively small effect sizes. Unlike Mendelian traits, the effects of genotypes on phenotypes are not large in such diseases. To cope with the problem of small effect size as well as multiple comparisons, a large sample size is necessary.
An additional issue to consider in the study of multifactorial traits is the effect of nongenetic factors. For example, gender, age, habit, diet, and exposure to chemicals may be associated with the development of phenotypes. In some cases, such nongenetic factors may affect each other or may interact with genetic factors for the development of phenotypes leading to a very complicated structure of interactions. In this investigation, we show such an example and a potential method to dissect the complicated structure of the associations.
Enzymes and transporters possibly related to transportation, metabolism, and action of chemical agents may be associated with either efficacy or adverse drug interactions. In a previous study, we constructed a database of SNPs and haplotypes for 4,190 SNPs in 199 different genes using blood samples from 752 Japanese subjects (Kamatani et al. 2004). For those subjects, 38 different quantitative traits were available. Some of these traits may be associated with either genotypes or diplotype configurations (combination of two haplotypes) in some genes independently or with interactions. We performed an extensive analysis to search for such associations using all available genomic variations and traits. Through such an analysis, we observed a strong association between SNPs of aldehyde dehydrogenase 2 (ALDH2) and serum gamma-glutamyltranspeptidase (gamma-GTP) levels. Serum gamma-GTP elevates after alcohol ingestion (Rosalki 1973, 1974; Boone et al. 1977; Spencer-Peet et al. 1975; Teschke et al. 1977), and a polymorphism (rs671) in ALDH2 is associated with individual tolerance to alcohol (Impraim et al. 1982; Goedde et al. 1986, 1992; Crabb et al. 1989; Shibuya et al. 1989). In fact, gamma-GTP levels were associated with alcohol-drinking habit and gender in addition to the rs671 SNP genotype in our data. In this study, we analyzed data to dissect the complicated structure of associations between genomic variations in ALDH2, alcohol-drinking habit, gender, age, and serum gamma-GTP levels to establish an effective method to investigate associations of genomic variations with both qualitative and quantitative traits.
Materials and methods
Subjects
New samples were not collected; data collected in a previous study (Kamatani et al. 2004) were used. From that study, genotype data of 4,190 SNPs in 199 different genes from 752 Japanese subjects were available. Those SNPs had been found from genes coding for enzymes and transporters possibly related to transportation, metabolism, and action of chemical agents (Kamatani et al. 2004). SNPs within the same genes are often in linkage disequilibrium (LD) with each other, and the results of the LD analysis have been described elsewhere (Kamatani et al. 2004). For those subjects, 38 different quantitative traits including serum gamma-GTP levels were obtained. Informed consents had been obtained from all subjects for using their DNA for any association studies, and the study had been approved by the institutional ethical committee.
Genotyping
Genotyping was performed by the Invader system (Ohnishi et al. 2001). Methods for the genotype data collection have been described previously (Kamatani et al. 2004).
Scan of the associations between SNPs and quantitative phenotypes
The computer program to scan all 4,190 SNPs and 38 different quantitative traits to find associations was made with C language. All other calculations and statistical analyses were performed in R.
SNP information
SNP information such as order and map position was obtained from the National Center for Biotechnology Information (NCBI) database (http://www.ncbi.nlm.nih.gov/). LD parameters D′ and r 2 were calculated using PENHAPLO and QTLHAPLO programs (Ito et al. 2004; Shibata et al. 2004).
Statistical analysis
Association between genotypes and phenotypes was performed by the single-locus analysis only. No haplotype-based association tests were performed. Since most of the 38 quantitative variables were not normally distributed, nonparametric methods were used. To compare quantitative variables in two different groups, the Mann–Whitney U test was used. To compare the same variables in more than two different groups, the Kruskal–Wallis test was used. To test the correlation between two quantitative variables, Spearman’s rank correlation coefficient was used. When the correlation between age and gamma-GTP level was examined, the parametric method after the logarithmic conversion of gamma-GTP level was used. All statistical tests and inferences, including logistic multiple regression analysis and linear multiple regression analysis, were performed in the R environment.
Results
Scan of associations between genotypes and quantitative traits
The test of association using each of the 4,190 SNPs and each of the 38 different quantitative traits was performed using the Mann–Whitneys test. For each SNP, two different comparisons were made: between one homozygote type and the rest of the subjects and between the other homozygote type and the rest of the subjects. When the SNP consisted of A and G alleles, for example, the subjects with the AA genotype were compared with those with the AG or GG genotype for one comparison, whereas the subjects with GG were compared with those with the AA or AG genotype for the other.
It was found that P values were among the lowest for the association between rs4646776 and rs671 SNPs within ALDH2 and serum gamma-GTP levels (Table 1). For rs671, the subjects with the GG genotype [n = 436, median 24.5, mean 39.8, standard deviation (SD) 42.0] had significantly higher gamma-GTP levels than those with the AA or AG genotype (n = 314, median 21.0, mean 31.2, SD 31.4) (P = 0.000299, Mann–Whitneys U test) (Table 2). The comparison between subjects with the AA genotype and those with the AG or GG genotype (AG + GG) at the same SNP showed a higher P value of 0.0493 (Table 2).
Linkage disequilibrium analysis of rs671 SNP
Closely located SNPs are often in strong linkage disequilibrium with each other. We determined the LD parameters between pairs of nine SNPs including rs671, which is the critical SNP for alcohol intolerance (Impraim et al. 1982; Goedde et al. 1986, 1992; Crabb et al. 1989; Shibuya et al. 1989). Table 1 shows that D′ values between rs671 and each of the other eight SNPs were either 1.0 or very close to 1, thereby indicating that rs671 was in almost complete LD (Feldman et al. 1974) with these SNPs. On the other hand, r 2 values between rs671 and rs4646776 but not between the other SNPs were close to 1.0. These data indicate that rs671 is almost in absolute LD (Walter and Cox 1988; Kasperczyk et al. 1990; Chehab et al. 1991) with rs4646776 but not with the other SNPs. The physical distance between rs671 and rs4646776 is about 11.7 kb, and it is not surprising that the two SNPs in this distance are in absolute LD.
If two SNPs are in absolute LD, it is expected that one of them can represent the other almost completely. As expected, rs4646776 but not the other SNPs was strongly associated with gamma-GTP level (Table 1). When gamma-GTP levels were compared between different genotypes at rs4646776, the difference was statistically significant by the Kruskal–Wallis test (P = 0.00105) (Table 1). When gamma-GTP levels were compared between the genotypes CC + CG and GG, they were significantly different (P = 0.000321, Mann–Whitney test).
Association between the genotype at rs671 and gamma-GTP level
SNP rs671 has been known to affect ALDH2 enzyme function (Impraim et al. 1982). Subjects with the AG genotype have a higher sensitivity to alcohol than those with the GG genotype, whereas those with the AA genotype are even more sensitive than those with the AG genotype (Impraim et al. 1982; Goedde et al. 1986, 1992; Crabb et al. 1989; Shibuya et al. 1989).
Although the P value of 0.000299 is quite low compared with the regular significance level of 0.05, it is much higher than the significance level required after Bonferroni’s correction for multiple comparisons. Since the test was performed for 4,190 SNPs and 38 quantitative traits with two different modes, the significance level required after Bonferroni’s correction will be 0.05/(4,190 × 38 × 2) = 1.57 × 10−7.
Next, gamma-GTP levels were compared between three different genotypes at rs671 SNP. Information was available for 750 subjects. Comparison between the gamma-GTP levels of the subjects with AA (n = 50), AG (n = 264), and GG (n = 436) genotypes at rs671 are shown by box plots in Fig. 1. Medians, means, and standard deviations (SD) of gamma-GTP levels with the three different genotypes as well as the combined genotypes (AA + AG and AG + GG) are shown in Table 2. The gamma-GTP levels in the three genotypes AA, AG, and GG were significantly different when tested by the Kruskal–Wallis test (P = 0.000988, Table 2), but the P value was still insufficient to reach the threshold of significance after Bonferroni’s correction (3.14 × 10−7). Data in Table 2 and Fig. 1 indicate that there is a gene dosage effect in the association between the rs671 SNP and gamma-GTP levels; i.e., this quantitative trait increases with an increasing dose of the G allele.
Association between each pair of five factors
Although the association between the rs671 genotype and gamma-GTP level was extracted from the association study in which multiple SNPs and phenotypes were included, the precise relationship between various factors should be analyzed with care. Previous studies have indicated the cause-and-effect relationship between rs671 genotype and alcohol-drinking habit (Impraim et al. 1982) and between alcohol-drinking habit and gamma-GTP level (Boone et al. 1977). However, other factors, including gender and age, are also likely to be involved.
In this study, we examined the structure of the relationship between two qualitative variables (gender and alcohol-drinking habit), two quantitative variables (age and gamma-GTP level), and the rs671 genotype (Fig. 2). Gender, alcohol-drinking habit, and the rs671 genotype can be interpreted as discrete random variables, whereas both age and gamma-GTP level can be interpreted as continuous random variables. Neither gamma-GTP level nor age is normally distributed, and the application of parametric methods is invalid. We first examined the association between each pair of five factors in Fig. 2. We had already obtained evidence of the association between the rs671 genotype (i.e., number of G alleles in the genotype) and gamma-GTP level (Figs. 1, 2a; Table 2) by nonparametric methods.
We next examined the association between gender and gamma-GTP level (Fig. 2b). Male subjects had significantly higher gamma-GTP levels than female subjects (Fig. 3a) (P < 2.2 × 10−16, Mann–Whitney test). We then examined the association between gender and rs671 genotype (Fig. 2c). Table 3 shows the number of genotypes in male and female subjects. There was no association between gender and rs671 genotype (chi-square test with 2 df, P = 0.202). Therefore, as expected from inheritance laws, the rs671 genotype was independent of gender (Fig. 2c).
We then examined the association between gender and alcohol-drinking habit (Fig. 2d). The subjects were categorized based on their answers to the question “Do you drink alcohol”? The three possible answers were 0 “no”; 1 “yes, periodically”; and 2 “yes, almost every day”. The question was answered by 734 subjects. Table 4 shows the subjects categorized by gender and their answers to the question. Gender was strongly associated with alcohol-drinking habit (Table 4) (P = 1.83 × 10−12, chi-square test with 2 df). Males were categorized as everyday drinkers more frequently than were female subjects [risk ratio (RR) 2.03, 95% confidence interval (CI) 1.64–2.50].
We then examined the association between alcohol-drinking habit and the rs671 genotype (Fig. 2e). The values for both variables were available from 732 subjects. Table 5 shows the association between the habit of alcohol ingestion and rs671 genotypes by a 3 × 3 contingency table. The proportions of subjects who drank alcohol (periodically or almost everyday) increased significantly with increasing numbers of G allele in the genotype (Table 5) (P < 2.2 × 10−16, chi-square test with 2 df). When the risk of everyday drinking was compared between different rs671 genotypes, the probability of becoming everyday drinkers was 4.69-fold (RR 95% CI 2.47–8.88, P = 1.22 × 10−15, chi-square test with 1 df) as high for subjects with the AG genotype as those with the AA genotype, whereas it was 5.36-fold (RR 95%CI 2.84–10.1, P = 5.18 × 10−30, chi-square test with 1 df) as high as those with the GG genotype (calculated from Table 5).
Then, we examined the association between alcohol-drinking habit and gamma-GTP level (Fig. 2f) from 734 subjects. Gamma-GTP levels were much higher in everyday drinkers than in periodic drinkers, and they were much higher in periodic drinkers than in nondrinkers (Fig. 4a) (P < 2.2 × 10−16, Kruskal–Wallis test). These results indicate that the alcohol-drinking habit was strongly associated with gamma-GTP level (Fig. 2f).
We then examined the association between age and alcohol-drinking habit. Ages of everyday drinkers (mean 42.1, SD 11.1, n = 312) were higher than those of nondrinkers (mean 37.4, SD 11.8, n = 166, P = 2.80 × 10−6, Mann–Whitney U test) and those of periodic drinkers (mean 35.6, SD 10.8, n = 256, P = 6.55 × 10−13, Mann–Whitney U test) (Kruskal–Wallis test, P = 6.66 × 10−13). These results indicate that age is clearly associated with alcohol-drinking habit (Fig. 2g). In addition, age was positively correlated with gamma-GTP levels (Spearman’s rank correlation test, P < 2.2 × 10−16, r = 0.408, n = 752), thereby showing that age is associated with gamma-GTP level (Fig. 2h). The ages of male subjects (n = 449, mean 40.5, SD 11.4) were significantly (Mann–Whitney U test, P = 1.22 × 10−9) higher than female subjects (n = 303, mean 36.1, SD 11.5). Therefore, age and gender are associated with each other in our study (Fig. 2i). Then, we compared the ages between different rs671 genotypes. The ages in the three different genotypes were not significantly different (Kruskal–Wallis test, P = 0.5), thereby indicating that age and rs671 genotype are not associated with each other (Fig. 2j).
In summary, we have shown clear associations between all pairs of five factors in Fig. 2 except for rs671 genotype and gender and rs671 genotype and age. However, the structure of the relationship in Fig. 2 is very complicated and needs further investigation.
Multiple regression analysis
The first step to study the effects of various factors on a variable is multiple regression analysis. First, logistic multiple regression analysis was performed by setting alcohol-drinking habit as the criterion variable. In this binary-type analysis, alcohol-drinking habit was categorized as 0 (nondrinker or periodic drinker) or 1 (everyday drinker). The explanatory variables were age, gender, and rs671 genotype. Gamma-GTP level was not used because it does not likely affect the alcohol-drinking habit. Gender was coded as 1 (male) and 2 (female), and rs671 genotype was coded as -1 (AA), 0 (AG) and 1 (GG). Note that the explanatory variables age, gender, and rs671 genotype are theoretically independent from each other, although there was an association between age and gender (Fig. 2). Table 6 shows the results of the logistic multiple regression analysis. As shown, elder age, male gender, and G allele at rs671 significantly increase the probability of everyday drinking. P values were the lowest for the rs671 genotype, followed by gender and age. These results were as expected and the interpretation was straightforward.
In the next step, data was analyzed by linear multiple regression analysis with log gamma-GTP level as the criterion variable. In fact, log gamma-GTP level does not necessarily fit a normal distribution; however, it is much better than raw gamma-GTP level (data not shown). Explanatory variables were age, gender, alcohol-drinking habit, and rs671 genotype. Gender was coded as 1 (male) and 2 (female), whereas alcohol-drinking habit was coded as nondrinker (0), periodic drinker (1), and everyday drinker (2). The rs671 genotype was coded as −1 (AA), 0 (AG), and 1 (GG). There is a limitation in this analysis because explanatory variables are dependent on each other. Table 7 shows the results of this analysis. Although the results suggest that gender, age, alcohol-drinking habit, and rs671 genotype are associated with gamma-GTP level, the interpretation is not easy, because explanatory variables are associated with each other in a complicated manner (Fig. 2).
Subgroup analysis to examine the dependence of the association on other factors
Although investigation of the relationship between various factors as in Fig. 2 is not easy, a straightforward method is to examine the association in a rather homogeneous subgroup. One may stratify the entire sample into subgroups according to categorical variables until the subgroup populations become rather homogenous. However, the problem is that one cannot classify the sample according to a continuous variable and that the final subgroups may be too small for statistical analysis. By taking advantage of the large size of our sample, we attempted to perform association studies using subdivided samples. There were three categorical variables (gender, alcohol-drinking habit, and rs671 genotype). If we used all three categorical variables, we could divide the entire sample into 18 subgroups. However, the test of association using each of the 18 subgroups was not realistic. We may be able to ignore the effect of some factors because of the evidence of independence or very little dependence.
Association between rs671 genotype and alcohol-drinking habit
We first wanted to show the association between alcohol-drinking habit and rs671 genotype independent of other factors. Since both age and gender were independent of rs671 genotype (Fig. 2c, j), these two factors could be ignored when the association between alcohol-drinking habit and rs671 genotype was considered. Gamma-GTP level can never be the cause of rs671 genotype and is not likely to be the cause of alcohol-drinking habit, and is also ignored. Therefore, the three factors, age, gender, and gamma-GTP level, could be ignored when we examined the independence of the association between rs671 genotype and alcohol-drinking habit. Since we previously showed the association between rs671 genotype and alcohol-drinking habit (Table 5; Fig. 2e), they are really associated with each other independent of gender and age. Then, we examined the association between rs671 genotype and alcohol-drinking habit in male and female subjects separately. Table 5 indicates that the association between rs671 genotype and alcohol-drinking habit was strong even when data for male and female subjects were analyzed separately. Thus, rs671 genotype was associated with alcohol-drinking habit independent of other factors.
Association between alcohol-drinking habit and gamma-GTP level
We next investigated whether the association between alcohol-drinking habit and gamma-GTP level (Fig. 2f) was independent of other factors by subgroup analysis. Since gender and rs671 genotype were independent of each other (Fig. 2c), the effects of those factors on the association were studied separately. Figure 4b, c indicates that the association between alcohol-drinking habit and gamma-GTP level was replicated in each gender. Next, we examined the association between alcohol-drinking habit and gamma-GTP level within each genotype. Comparison of gamma-GTP levels between the subjects with different alcohol-drinking habits in the group of AA genotype was not possible because of the small sample size. Therefore, we compared gamma-GTP levels between subjects with different alcohol-drinking habits in each of the groups of AG and GG genotypes (Fig. 5). Even among the subjects with the same genotypes (AG or GG), there were significant differences in gamma-GTP levels (Fig. 5a, b). In addition, we examined whether there was an association between alcohol-drinking habit and gamma-GTP level in male subjects with the GG genotype at rs671 (n = 265). Even in such a subgroup with the same gender and the same genotype, there was a difference in gamma-GTP level between different alcohol-drinking habits (Kruskal–Wallis test, P = 1.75 × 10−5, AA = 29, AG = 75, GG = 161). Therefore, the association between alcohol-drinking habit and gamma-GTP level was independent of other factors.
Association between rs671 genotype and gamma-GTP level
The association between gamma-GTP level and rs671 genotype was the first association detected by the association scan. We then examined whether the association between rs671 genotype and gamma-GTP level is independent of other factors. Since gender and age were independent of rs671 genotype (Fig. 2c, j), we examined whether the association was dependent on alcohol-drinking habit to show that the association was independent of other factors. However, since associations between gender and alcohol-drinking habit and between gender and gamma-GTP level were very strong, we first examined the association between rs671 genotype and gamma-GTP level conditional on gender. In male subjects, the gamma-GTP levels between the three genotypes were significantly different as tested by the Kruskal–Wallis test (P = 3.02 × 10−5, Table 2, Fig. 6). When the heterozygotes (AG) were combined with either AA or GG homozygotes, the tests of the differences between two subgroups by Mann–Whitney test showed low P values (AA vs. AG + GG, P = 0.000275; AA + AG vs. GG, P = 0.000146) (Table 2). In general, P values were lower when only male samples were examined than when the entire sample was used (Table 2). The results of the comparison of gamma-GTP levels in different genotypes in only the female samples are shown in Table 2 and Fig. 7. The Kruskal–Wallis test showed a P value of 5.74 × 10−5. Comparison between AA and AG + GG showed a P value of 0.3446, whereas comparison between AA + AG and GG showed a P value of 1.05 × 10−5 (both by Mann–Whitney test).
Then the association between rs671 genotype and gamma-GTP level was examined conditional on alcohol-drinking habit. When the subjects were limited to nondrinkers (n = 166), the subjects with the AA genotype had higher gamma-GTP levels than those with AG or GG (Table 8, P = 0.02, Mann–Whitney test). When the same association was examined using all subjects, the presence of the G allele was positively associated with gamma-GTP level (Table 2; Fig. 1). The apparently paradoxical association between the rs671 genotype and gamma-GTP level disappeared when the analysis was performed in male and female subjects separately (Table 8). P values obtained were all above 0.05 in male and female subjects (Table 8). The apparently paradoxical association between rs671 genotype and gamma-GTP level observed in the entire sample is explained by having both genders in the analysis. Thus, the proportion of AA genotype among nondrinker female subjects was 14/91 = 0.154, whereas that among nondrinker male subjects was 28/75 = 0.373 (P = 0.00022, chi-square test). This is because the proportion of nondrinkers among subjects with AG or GG genotype was higher in female (77/278 = 0.277) than in male subjects (47/404 = 0.113) (P = 1.57 × 10−7, chi-square test). Since gamma-GTP levels were higher in male than in female subjects (Fig. 3), even when the analyses were performed in nondrinkers (Table 8), the subjects with the AA genotype paradoxically showed higher gamma-GTP levels than the other genotypes when the analysis was done in nondrinkers (male and female subjects combined). The results of the analyses of the associations between rs671 genotypes and gamma-GTP levels in the subjects with a nondrinking habit suggest that the association between rs671 genotype and gamma-GTP level is dependent on alcohol-drinking habit. Therefore, the association between rs671 genotype and gamma-GTP level is not independent of other factors, and is an apparent association.
Association between gender and gamma-GTP level
Then we examined whether the association between gender and gamma-GTP level (Fig. 2b) was dependent on other factors. Gamma-GTP levels were significantly higher in male than in female subjects among nondrinkers (P = 1.10 × 10−12, Mann–Whitney U test, Fig. 3b), periodic drinkers (P < 2.22 × 10−12, Mann–Whitney U test, Fig. 3c), and everyday drinkers (P < 2.22 × 10−12, Mann–Whitney U test, Fig. 3d). These results indicate that gamma-GTP level is associated with gender independent of the alcohol-drinking habit. To show that the association between gamma-GTP level and gender is independent of other factors, the effect of age on the association should be examined. However, the test of correlation between age and gamma-GTP level in male and female subjects separately (but not stratified with the alcohol-drinking habit) has shown that the association was not significant either in male (P = 0.058, Spearman’s rank correlation test, r = 0.221) or in female subjects (P = 0.213, Spearman’s rank correlation test, r = 0.133). Therefore, the association between gender and gamma-GTP level is not likely to depend greatly on age.
Association between age and gamma-GTP level
We examined whether the association between age and gamma-GTP level was independent of other factors. Figure 8 shows the relationship between age and log gamma-GTP level in male subjects who were nondrinkers (Fig. 8a), periodic drinkers (Fig. 8b), or everyday drinkers (Fig. 8c). There were significant positive correlations between age and log gamma-GTP level in both male periodic drinkers (Fig. 8b, P = 4.7 × 10−5) and male everyday drinkers (Fig. 8c, P = 5.0 × 10−5). Although there was a nonsignificant correlation between age and gamma-GTP level in male nondrinkers (Fig. 8a), it may reflect a small sample size (n = 75). When the associations were tested in nondrinkers (Fig. 9a, P = 0.021), periodic drinkers (Fig. 9b, P = 6.75 × 10−7), and everyday drinkers (Fig. 9c, P = 0.0015) separately among female subjects, there were significant positive correlations.
Therefore, gamma-GTP level was positively correlated with age independent of gender and alcohol-drinking habit.
Discussion
A scan of 4,190 SNPs in 199 different genes using 38 different quantitative traits to search for associations between genotypes and phenotypes detected an association between rs671 genotypes and gamma-GTP levels. P values, however, were far above the significance level after Bonferroni’s correction. Since the association between ALHD2 genotypes and gamma-GTP levels has already been reported (Takagi et al. 2001) and is supported by functional evidence (Impraim et al. 1982), the association is likely to be true. The results of this study indicate how difficult it is to detect associations from thousands of SNPs and many quantitative traits considering the multiple-comparison problem. However, replication studies may successfully identify new associations between genotypes and phenotypes.
Even if we confirm the association between an SNP and a phenotype, it is not necessarily a direct association. However, although difficult, it is important to decide which association is direct and which is not. Theoretical consideration is sometimes useful; however, careful statistical analyses of the obtained data may help deepen the understanding of the associations.
We first examined the associations between pairs of five factors: rs671 genotype, alcohol-drinking habit, gender, gamma-GTP level, and age. Surprisingly, statistical associations were observed in all pairs of five factors except for the pair of rs671 genotype and gender and rs671 genotype and age (Fig. 2). It is unlikely, however, that all apparent associations between the pairs of factors reflect direct associations.
Our logistic regression analyses, by making the alcohol-drinking habit the criterion variable and rs671 genotype, gender, and age the explanatory variables, have shown that all three explanatory variables are associated with the alcohol-drinking habit. Linear multiple regression analyses, by making gamma-GTP level as the criterion variable and age, gender, alcohol-drinking habit, and rs671 genotype as explanatory variables, have indicated that all explanatory variables are associated with gamma-GTP level. However, more precise analyses were necessary to examine whether the associations are independent of other factors.
Subgroup analysis was very useful to examine whether an association of a pair of two variables was independent of the factors used for the stratification. In this way, the associations between rs671 genotype and alcohol-drinking habit, alcohol-drinking habit and gamma-GTP level, gender and gamma-GTP, and age and gamma-GTP were judged independent of other factors. Conversely, the association between rs671 genotype and gamma-GTP level was considered dependent on the alcohol-drinking habit. Although the direct associations between rs671 genotype and alcohol-drinking habit and between alcohol-drinking habit and gamma-GTP level are supported by functional evidence and previous reports, the associations between gamma-GTP level and gender and gamma-GTP level and age are of interest. The associations between gamma-GTP and age (Puukka et al. 2006) and gamma-GTP and gender (Stewart et al. 2007) have been previously reported.
The associations between genotypes and multiple phenotypes are often very complicated. Previous studies have shown the association between ALDH2 genotypes and hypertension (Tsuritani et al. 1995). In addition, gamma-GTP levels have been reported to be associated with hypertension (Takagi et al. 2001; Shankar and Li 2007), cardiovascular event (Fraser et al. 2007b; Patel et al. 2007; Yang et al. 2007; Hozawa et al. 2007; Turgut et al. 2006; Lee et al. 2006), type 2 diabetes (Doi et al. 2007; Fraser et al. 2007a; Lim et al. 2007), obesity (Lim et al. 2007), adiponectin (Kim et al. 2007), hyperlipidemia (Lippi et al. 2007), and metabolic syndrome (Andre et al. 2007; Lee et al. 2007). Although most of these associations are thought to be indirect and dependent on other factors such as alcohol-drinking habit, careful analyses of the data from the statistical viewpoint are necessary.
Associations between genetic and environmental factors will become a focus of medical and biological studies, since recent genome-wide association studies are expected to identify many association candidates (Ozaki et al. 2002; Suzuki et al. 2003; Kochi et al. 2005; The Wellcome Trust Case Control Consortium 2007). Our study reported here shows that (1) a large sample size combined with a replication study is necessary to overcome the multiple-comparison problem, and (2) subgroup analysis along with logistic and linear multiple regression analysis may be useful to dissect the complicated relationship.
References
Andre P, Balkau B, Vol S, Charles MA, Eschwege E (2007) Gamma-glutamyltransferase activity and development of the metabolic syndrome (International Diabetes Federation Definition) in middle-aged men and women: Data from the Epidemiological Study on the Insulin Resistance Syndrome (DESIR) cohort. Diabetes Care 30:2355–2361
Boone DJ, Tietz NW, Weinstock A (1977) Significance of gamma-glutamyl transferase (GGT) activity measurements in alcohol-induced hepatic injury. Ann Clin Lab Sci 7:25–28
Chehab FF, Johnson J, Louie E, Goossens M, Kawasaki E, Erlich H (1991) A dimorphic 4-bp repeat in the cystic fibrosis gene is in absolute linkage disequilibrium with the delta F508 mutation: implications for prenatal diagnosis and mutation origin. Am J Hum Genet 48:223–226
Crabb DW, Edenberg HJ, Bosron WF, Li TK (1989) Genotypes for aldehyde dehydrogenase deficiency and alcohol sensitivity. The inactive ALDH2(2) allele is dominant. J Clin Invest 83:314–316
Doi Y, Kubo M, Yonemoto K, Ninomiya T, Iwase M, Tanizaki Y, Shikata K, Iida M, Kiyohara Y (2007) Liver enzymes as a predictor for incident diabetes in a Japanese population: the Hisayama study. Obesity 15:1841–1850
Feldman MW, Franklin I, Thomson GJ (1974) Selection in complex genetic systems. I. The symmetric equilibria of the three-locus symmetric viability model. Genetics 76:135–162
Fraser A, Ebrahim S, Smith GD, Lawlor DA (2007a) A comparison of associations of alanine aminotransferase and gamma-glutamyltransferase with fasting glucose, fasting insulin, and glycated hemoglobin in women with and without diabetes. Hepatology 46:158–165
Fraser A, Harris R, Sattar N, Ebrahim S, Smith GD, Lawlor DA (2007b) Gamma-glutamyltransferase is associated with incident vascular events independently of alcohol intake: analysis of the British Women’s Heart and Health Study and Meta-analysis. Arterioscler Thromb Vasc Biol 27:2729–2735
Goedde HW, Agarwal DP, Harada S, Rothhammer F, Whittaker JO, Lisker R (1986) Aldehyde dehydrogenase polymorphism in North American, South American, and Mexican Indian populations. Am J Hum Genet 38:395–399
Goedde HW, Agarwal DP, Fritze G, Meier-Tackmann D, Singh S, Beckmann G, Bhatia K, Chen LZ, Fang B, Lisker R et al (1992) Distribution of ADH2 and ALDH2 genotypes in different populations. Hum Genet 88:344–346
Hozawa A, Okamura T, Kadowaki T, Murakami Y, Nakamura K, Hayakawa T, Kita Y, Nakamura Y, Okayama A, Ueshima H (2007) NIPPON DATA90 research group. Gamma-Glutamyltransferase predicts cardiovascular death among Japanese women. Atherosclerosis 194:498–504
Impraim C, Wang G, Yoshida A (1982) Structural mutation in a major human aldehyde dehydrogenase gene results in loss of enzyme activity. Am J Hum Genet 34:837–841
Ito T, Inoue E, Kamatani N (2004) Association test algorithm between a qualitative phenotype and a haplotype or haplotype set using simultaneous estimation of haplotype frequencies, diplotype configurations and diplotype-based penetrances. Genetics 168: 2339–2348
Kamatani N, Sekine A, Kitamoto T, Iida A, Saito S, Kogame A, Inoue E, Kawamoto M, Harigai M, Nakamura Y (2004) Large-scale single-nucleotide polymorphism (SNP) and haplotype analyses, using dense SNP Maps, of 199 drug-related genes in 752 subjects: the analysis of the association between uncommon SNPs within haplotype blocks and the haplotypes constructed with haplotype-tagging SNPs. Am J Hum Genet 75:190–203
Kasperczyk A, DiMartino NA, Krontiris TG (1990) Minisatellite allele diversification: the origin of rare alleles at the HRAS1 locus. Am J Hum Genet 47:854–859
Kim JA, Kim SM, Yoon D (2007) The relationship between gamma-glutamyltransferase and adiponectin in nonalcoholic women. Metabolism 56:578–582
Kochi Y et al (2005) A functional variant in FCRL3, encoding Fc receptor-like 3, is associated with rheumatoid arthritis and several autoimmunities. Nat Genet 37:478–485
Lee DH, Silventoinen K, Hu G, Jacobs DR Jr, Jousilahti P, Sundvall J,Tuomilehto J (2006) Serum gamma-glutamyltransferase predicts non-fatal myocardial infarction and fatal coronary heart disease among 28,838 middle-aged men and women. Eur Heart J 27:2170–2176
Lee DS, Evans JC, Robins SJ, Wilson PW, Albano I, Fox CS, Wang TJ, Benjamin EJ, D’Agostino RB, Vasan RS (2007) Gamma glutamyl transferase and metabolic syndrome, cardiovascular disease, and mortality risk: the Framingham heart study. Arterioscler Thromb Vasc Biol 27:127–133
Lim JS, Lee DH, Park JY, Jin SH, Jacobs DR Jr (2007) A strong interaction between serum gamma-glutamyltransferase and obesity on the risk of prevalent type 2 diabetes: results from the third national health and nutrition examination survey. Clin Chem 53:1092–1098
Lippi G, Targher G, Montagnana M, Salvagno GL, Guidi GC (2007) Relationship between gamma-glutamyltransferase, lipids and lipoprotein(a) in the general population. Clin Chim Acta 384:163–166
Ohnishi Y, Tanaka T, Ozaki K, Yamada R, Suzuki H, Nakamura Y (2001) A high-throughput SNP typing system for genome-wide association studies. J Hum Genet 46: 471–477
Ozaki K et al (2002) Functional SNPs in the lymphotoxin-alpha gene that are associated with susceptibility to myocardial infarction. Nat Genet 32:650–654
Patel DA, Srinivasan SR, Xu JH, Chen W, Berenson GS (2007) Persistent elevation of liver function enzymes within the reference range is associated with increased cardiovascular risk in young adults: the Bogalusa heart study. Metabolism 56:792–798
Puukka K, Hietala J, Koivisto H, Anttila P, Bloigu R, Niemela O (2006) Age-related changes on serum ggt activity and the assessment of ethanol intake. Alcohol Alcohol 41:522–527
Rosalki SB (1973) Letter: screening test for alcoholism. Lancet 7833:843
Rosalki SB (1974) Letter: gamma-glutamyl transpeptidase in alcoholism. Am J Clin Pathol 62:579–580
Shankar A, Li J (2007) Association between serum gamma-glutamyltransferase level and prehypertension among US adults. Circ J 71:1567–1572
Shibata K, Ito T, Kitamura Y, Iwasaki N, Tanaka H, Kamatani N (2004) Simultaneous estimation of haplotype frequencies and quantitative trait parameters: applications to the test of association between phenotype and diplotype configuration. Genetics 168:525–539
Shibuya A, Yasunami M, Yoshida A (1989) Genotype of alcohol dehydrogenase and aldehyde dehydrogenase loci in Japanese alcohol flushers and nonflushers. Hum Genet 82:14–16
Spencer-Peet J, Wood DC, Glatt MM, Wiseman SM (1975) Urinary d-glucaric acid excretion and serum gamma-glutamyl transpeptidase activity in alcoholism. Br J Addict Alcohol Other Drugs 70:359–364
Stewart SH, Connors GJ, Hutson A (2007) Ethnicity and gamma-glutamyltransferase in men and women with alcohol use disorders. Alcohol Alcohol 42:24–27
Suzuki A et al (2003) Functional haplotypes of PADI4, encoding citrullinating enzyme peptidylarginine deiminase 4, are associated with rheumatoid arthritis. Nat Genet 34:395–402
Takagi S, Baba S, Iwai N, Fukuda M, Katsuya T, Higaki J, Mannami T, Ogata J, Goto Y, Ogihara T (2001) The aldehyde dehydrogenase 2 gene is a risk factor for hypertension in Japanese but does not alter the sensitivity to pressor effects of alcohol: the Suita study. Hypertens Res 24:365–370
Teschke R, Brand A, Strohmeyer G (1977) Induction of hepatic microsomal gamma-glutamyltransferase activity following chronic alcohol consumption. Biochem Biophys Res Commun 175:718–724
The Wellcome Trust Case Control Consortium (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447:661–678
Tsuritani I, Ikai E, Date T, Suzuki Y, Ishizaki M, Yamada Y (1995) Polymorphism in ALDH2-genotype in Japanese men and the alcohol-blood pressure relationship. Am J Hypertens 8:1053–1059
Turgut O, Yilmaz A, Yalta K, Karadas F, Birhan Yilmaz M (2006) gamma-Glutamyltransferase is a promising biomarker for cardiovascular risk. Med Hypotheses 67:1060–1064
Walter MA, Cox DW (1988) Analysis of genetic variation reveals human immunoglobulin VH-region gene organization. Am J Hum Genet 42:446–451
Yang JJ, Shiwaku K, Nabika T, Masuda J, Kobayashi S (2007) High frequency of cardiovascular risk factors in overweight adult Japanese subjects. Arch Med Res 38:337–344
Acknowledgments
This research was supported by the Health and Labour Sciences Research Grants from Research on Human Genome tailor-made by the Ministry of Health, Labour and Welfare of Japan. We thank Ms. Aki Nakano for editorial assistance.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nose, J., Saito, A. & Kamatani, N. Statistical analysis of the associations between polymorphisms within aldehyde dehydrogenase 2 (ALDH2), and quantitative and qualitative traits extracted from a large-scale database of Japanese single-nucleotide polymorphisms (SNPs). J Hum Genet 53, 425–438 (2008). https://doi.org/10.1007/s10038-008-0266-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10038-008-0266-2
Keywords
This article is cited by
-
Association between aldehyde dehydrogenase 2 polymorphisms and the incidence of diabetic retinopathy among Japanese subjects with type 2 diabetes mellitus
Cardiovascular Diabetology (2013)
-
Genome-wide association study of hematological and biochemical traits in a Japanese population
Nature Genetics (2010)