Introduction

Due to the availability of accurate genotyping systems for thousands or millions of single-nucleotide polymorphisms (SNPs), genetic association studies are now a focus of medical research (Kamatani et al. 2004; Rosalki 1973, 1974; Boone et al. 1977; Spencer-Peet et al. 1975; Teschke et al. 1977; Impraim et al. 1982). One problem in using thousands of SNPs is multiple comparisons. Thus, if the test is performed using thousands of SNPs, a certain number of SNPs show very low P values, even if they are not truly associated with the traits. To overcome the problem of multiple testing, such methods as Bonferroni’s correction are used; however, in this case, extremely low P values are required for SNPs to be truly associated with the traits.

Another problem in association studies for multifactorial diseases is relatively small effect sizes. Unlike Mendelian traits, the effects of genotypes on phenotypes are not large in such diseases. To cope with the problem of small effect size as well as multiple comparisons, a large sample size is necessary.

An additional issue to consider in the study of multifactorial traits is the effect of nongenetic factors. For example, gender, age, habit, diet, and exposure to chemicals may be associated with the development of phenotypes. In some cases, such nongenetic factors may affect each other or may interact with genetic factors for the development of phenotypes leading to a very complicated structure of interactions. In this investigation, we show such an example and a potential method to dissect the complicated structure of the associations.

Enzymes and transporters possibly related to transportation, metabolism, and action of chemical agents may be associated with either efficacy or adverse drug interactions. In a previous study, we constructed a database of SNPs and haplotypes for 4,190 SNPs in 199 different genes using blood samples from 752 Japanese subjects (Kamatani et al. 2004). For those subjects, 38 different quantitative traits were available. Some of these traits may be associated with either genotypes or diplotype configurations (combination of two haplotypes) in some genes independently or with interactions. We performed an extensive analysis to search for such associations using all available genomic variations and traits. Through such an analysis, we observed a strong association between SNPs of aldehyde dehydrogenase 2 (ALDH2) and serum gamma-glutamyltranspeptidase (gamma-GTP) levels. Serum gamma-GTP elevates after alcohol ingestion (Rosalki 1973, 1974; Boone et al. 1977; Spencer-Peet et al. 1975; Teschke et al. 1977), and a polymorphism (rs671) in ALDH2 is associated with individual tolerance to alcohol (Impraim et al. 1982; Goedde et al. 1986, 1992; Crabb et al. 1989; Shibuya et al. 1989). In fact, gamma-GTP levels were associated with alcohol-drinking habit and gender in addition to the rs671 SNP genotype in our data. In this study, we analyzed data to dissect the complicated structure of associations between genomic variations in ALDH2, alcohol-drinking habit, gender, age, and serum gamma-GTP levels to establish an effective method to investigate associations of genomic variations with both qualitative and quantitative traits.

Materials and methods

Subjects

New samples were not collected; data collected in a previous study (Kamatani et al. 2004) were used. From that study, genotype data of 4,190 SNPs in 199 different genes from 752 Japanese subjects were available. Those SNPs had been found from genes coding for enzymes and transporters possibly related to transportation, metabolism, and action of chemical agents (Kamatani et al. 2004). SNPs within the same genes are often in linkage disequilibrium (LD) with each other, and the results of the LD analysis have been described elsewhere (Kamatani et al. 2004). For those subjects, 38 different quantitative traits including serum gamma-GTP levels were obtained. Informed consents had been obtained from all subjects for using their DNA for any association studies, and the study had been approved by the institutional ethical committee.

Genotyping

Genotyping was performed by the Invader system (Ohnishi et al. 2001). Methods for the genotype data collection have been described previously (Kamatani et al. 2004).

Scan of the associations between SNPs and quantitative phenotypes

The computer program to scan all 4,190 SNPs and 38 different quantitative traits to find associations was made with C language. All other calculations and statistical analyses were performed in R.

SNP information

SNP information such as order and map position was obtained from the National Center for Biotechnology Information (NCBI) database (http://www.ncbi.nlm.nih.gov/). LD parameters D′ and r 2 were calculated using PENHAPLO and QTLHAPLO programs (Ito et al. 2004; Shibata et al. 2004).

Statistical analysis

Association between genotypes and phenotypes was performed by the single-locus analysis only. No haplotype-based association tests were performed. Since most of the 38 quantitative variables were not normally distributed, nonparametric methods were used. To compare quantitative variables in two different groups, the Mann–Whitney U test was used. To compare the same variables in more than two different groups, the Kruskal–Wallis test was used. To test the correlation between two quantitative variables, Spearman’s rank correlation coefficient was used. When the correlation between age and gamma-GTP level was examined, the parametric method after the logarithmic conversion of gamma-GTP level was used. All statistical tests and inferences, including logistic multiple regression analysis and linear multiple regression analysis, were performed in the R environment.

Results

Scan of associations between genotypes and quantitative traits

The test of association using each of the 4,190 SNPs and each of the 38 different quantitative traits was performed using the Mann–Whitneys test. For each SNP, two different comparisons were made: between one homozygote type and the rest of the subjects and between the other homozygote type and the rest of the subjects. When the SNP consisted of A and G alleles, for example, the subjects with the AA genotype were compared with those with the AG or GG genotype for one comparison, whereas the subjects with GG were compared with those with the AA or AG genotype for the other.

It was found that P values were among the lowest for the association between rs4646776 and rs671 SNPs within ALDH2 and serum gamma-GTP levels (Table 1). For rs671, the subjects with the GG genotype [n = 436, median 24.5, mean 39.8, standard deviation (SD) 42.0] had significantly higher gamma-GTP levels than those with the AA or AG genotype (n = 314, median 21.0, mean 31.2, SD 31.4) (P = 0.000299, Mann–Whitneys U test) (Table 2). The comparison between subjects with the AA genotype and those with the AG or GG genotype (AG + GG) at the same SNP showed a higher P value of 0.0493 (Table 2).

Table 1 Single-nucleotide polymorphisms (SNPs) in ALDH2 gene and linkage disequilibrium (LD) parameters
Table 2 Comparison of gamma-glutamyltranspeptidase (gamma-GTP) levels between subjects with different genotypes at rs671

Linkage disequilibrium analysis of rs671 SNP

Closely located SNPs are often in strong linkage disequilibrium with each other. We determined the LD parameters between pairs of nine SNPs including rs671, which is the critical SNP for alcohol intolerance (Impraim et al. 1982; Goedde et al. 1986, 1992; Crabb et al. 1989; Shibuya et al. 1989). Table 1 shows that D′ values between rs671 and each of the other eight SNPs were either 1.0 or very close to 1, thereby indicating that rs671 was in almost complete LD (Feldman et al. 1974) with these SNPs. On the other hand, r 2 values between rs671 and rs4646776 but not between the other SNPs were close to 1.0. These data indicate that rs671 is almost in absolute LD (Walter and Cox 1988; Kasperczyk et al. 1990; Chehab et al. 1991) with rs4646776 but not with the other SNPs. The physical distance between rs671 and rs4646776 is about 11.7 kb, and it is not surprising that the two SNPs in this distance are in absolute LD.

If two SNPs are in absolute LD, it is expected that one of them can represent the other almost completely. As expected, rs4646776 but not the other SNPs was strongly associated with gamma-GTP level (Table 1). When gamma-GTP levels were compared between different genotypes at rs4646776, the difference was statistically significant by the Kruskal–Wallis test (P = 0.00105) (Table 1). When gamma-GTP levels were compared between the genotypes CC + CG and GG, they were significantly different (P = 0.000321, Mann–Whitney test).

Association between the genotype at rs671 and gamma-GTP level

SNP rs671 has been known to affect ALDH2 enzyme function (Impraim et al. 1982). Subjects with the AG genotype have a higher sensitivity to alcohol than those with the GG genotype, whereas those with the AA genotype are even more sensitive than those with the AG genotype (Impraim et al. 1982; Goedde et al. 1986, 1992; Crabb et al. 1989; Shibuya et al. 1989).

Although the P value of 0.000299 is quite low compared with the regular significance level of 0.05, it is much higher than the significance level required after Bonferroni’s correction for multiple comparisons. Since the test was performed for 4,190 SNPs and 38 quantitative traits with two different modes, the significance level required after Bonferroni’s correction will be 0.05/(4,190 × 38 × 2) = 1.57 × 10−7.

Next, gamma-GTP levels were compared between three different genotypes at rs671 SNP. Information was available for 750 subjects. Comparison between the gamma-GTP levels of the subjects with AA (n = 50), AG (n = 264), and GG (n = 436) genotypes at rs671 are shown by box plots in Fig. 1. Medians, means, and standard deviations (SD) of gamma-GTP levels with the three different genotypes as well as the combined genotypes (AA + AG and AG + GG) are shown in Table 2. The gamma-GTP levels in the three genotypes AA, AG, and GG were significantly different when tested by the Kruskal–Wallis test (P = 0.000988, Table 2), but the P value was still insufficient to reach the threshold of significance after Bonferroni’s correction (3.14 × 10−7). Data in Table 2 and Fig. 1 indicate that there is a gene dosage effect in the association between the rs671 SNP and gamma-GTP levels; i.e., this quantitative trait increases with an increasing dose of the G allele.

Fig. 1
figure 1

Comparison of gamma-glutamyltranspeptidase (gamma-GTP) levels between subjects with AA, AG, and =GG genotypes at rs671 are shown as box plots. Male and female subjects were combined as a single sample

Association between each pair of five factors

Although the association between the rs671 genotype and gamma-GTP level was extracted from the association study in which multiple SNPs and phenotypes were included, the precise relationship between various factors should be analyzed with care. Previous studies have indicated the cause-and-effect relationship between rs671 genotype and alcohol-drinking habit (Impraim et al. 1982) and between alcohol-drinking habit and gamma-GTP level (Boone et al. 1977). However, other factors, including gender and age, are also likely to be involved.

In this study, we examined the structure of the relationship between two qualitative variables (gender and alcohol-drinking habit), two quantitative variables (age and gamma-GTP level), and the rs671 genotype (Fig. 2). Gender, alcohol-drinking habit, and the rs671 genotype can be interpreted as discrete random variables, whereas both age and gamma-GTP level can be interpreted as continuous random variables. Neither gamma-GTP level nor age is normally distributed, and the application of parametric methods is invalid. We first examined the association between each pair of five factors in Fig. 2. We had already obtained evidence of the association between the rs671 genotype (i.e., number of G alleles in the genotype) and gamma-GTP level (Figs. 1, 2a; Table 2) by nonparametric methods.

Fig. 2
figure 2

Relationships between pairs of five different factors: rs671 genotype, gamma-glutamyltranspeptidase (gamma-GTP) level, age, gender, and alcohol-drinking habit. Since there are ten different pairs of factors, we can consider ten different relationships. Except for the relationships between rs671 genotype and gender (c), and age and rs671 genotype (j), all eight other relationships were shown to be dependent. Dependent and independent relationships are expressed by closed and dotted lines, respectively. Relationship between rs671 genotype and gamma-GTP level (a) was apparently dependent but judged to be independent by further analysis

We next examined the association between gender and gamma-GTP level (Fig. 2b). Male subjects had significantly higher gamma-GTP levels than female subjects (Fig. 3a) (P < 2.2 × 10−16, Mann–Whitney test). We then examined the association between gender and rs671 genotype (Fig. 2c). Table 3 shows the number of genotypes in male and female subjects. There was no association between gender and rs671 genotype (chi-square test with 2 df, P = 0.202). Therefore, as expected from inheritance laws, the rs671 genotype was independent of gender (Fig. 2c).

Fig. 3
figure 3

a Comparison of gamma-glutamyltranspeptidase (gamma-GTP) levels between male and female subjects. b Comparison of gamma-GTP levels between male and female subjects in nondrinkers. c Comparison of gamma-GTP levels between male and female subjects in periodic drinkers. d Comparison of gamma-GTP levels between male and female subjects in everyday drinkers

Table 3 Numbers of male and female subjects with different genotypes at rs671

We then examined the association between gender and alcohol-drinking habit (Fig. 2d). The subjects were categorized based on their answers to the question “Do you drink alcohol”? The three possible answers were 0 “no”; 1 “yes, periodically”; and 2 “yes, almost every day”. The question was answered by 734 subjects. Table 4 shows the subjects categorized by gender and their answers to the question. Gender was strongly associated with alcohol-drinking habit (Table 4) (P = 1.83 × 10−12, chi-square test with 2 df). Males were categorized as everyday drinkers more frequently than were female subjects [risk ratio (RR) 2.03, 95% confidence interval (CI) 1.64–2.50].

Table 4 Numbers of male and female subjects with different alcohol-drinking habits

We then examined the association between alcohol-drinking habit and the rs671 genotype (Fig. 2e). The values for both variables were available from 732 subjects. Table 5 shows the association between the habit of alcohol ingestion and rs671 genotypes by a 3 × 3 contingency table. The proportions of subjects who drank alcohol (periodically or almost everyday) increased significantly with increasing numbers of G allele in the genotype (Table 5) (P < 2.2 × 10−16, chi-square test with 2 df). When the risk of everyday drinking was compared between different rs671 genotypes, the probability of becoming everyday drinkers was 4.69-fold (RR 95% CI 2.47–8.88, P = 1.22 × 10−15, chi-square test with 1 df) as high for subjects with the AG genotype as those with the AA genotype, whereas it was 5.36-fold (RR 95%CI 2.84–10.1, P = 5.18 × 10−30, chi-square test with 1 df) as high as those with the GG genotype (calculated from Table 5).

Table 5 Numbers of subjects with different alcohol-drinking habits and different genotypes at rs671

Then, we examined the association between alcohol-drinking habit and gamma-GTP level (Fig. 2f) from 734 subjects. Gamma-GTP levels were much higher in everyday drinkers than in periodic drinkers, and they were much higher in periodic drinkers than in nondrinkers (Fig. 4a) (P < 2.2 × 10−16, Kruskal–Wallis test). These results indicate that the alcohol-drinking habit was strongly associated with gamma-GTP level (Fig. 2f).

Fig. 4
figure 4

Comparison of gamma-glutamyltranspeptidase (gamma-GTP) levels between subjects with different alcohol-drinking habits. ND nondrinker, PD periodic drinker, ED almost everyday drinker. a Male and female subjects were combined, or b male and c female subjects were analyzed separately

We then examined the association between age and alcohol-drinking habit. Ages of everyday drinkers (mean 42.1, SD 11.1, n = 312) were higher than those of nondrinkers (mean 37.4, SD 11.8, n = 166, P = 2.80 × 10−6, Mann–Whitney U test) and those of periodic drinkers (mean 35.6, SD 10.8, n = 256, P = 6.55 × 10−13, Mann–Whitney U test) (Kruskal–Wallis test, P = 6.66 × 10−13). These results indicate that age is clearly associated with alcohol-drinking habit (Fig. 2g). In addition, age was positively correlated with gamma-GTP levels (Spearman’s rank correlation test, P < 2.2 × 10−16, r = 0.408, n = 752), thereby showing that age is associated with gamma-GTP level (Fig. 2h). The ages of male subjects (n = 449, mean 40.5, SD 11.4) were significantly (Mann–Whitney U test, P = 1.22 × 10−9) higher than female subjects (n = 303, mean 36.1, SD 11.5). Therefore, age and gender are associated with each other in our study (Fig. 2i). Then, we compared the ages between different rs671 genotypes. The ages in the three different genotypes were not significantly different (Kruskal–Wallis test, P = 0.5), thereby indicating that age and rs671 genotype are not associated with each other (Fig. 2j).

In summary, we have shown clear associations between all pairs of five factors in Fig. 2 except for rs671 genotype and gender and rs671 genotype and age. However, the structure of the relationship in Fig. 2 is very complicated and needs further investigation.

Multiple regression analysis

The first step to study the effects of various factors on a variable is multiple regression analysis. First, logistic multiple regression analysis was performed by setting alcohol-drinking habit as the criterion variable. In this binary-type analysis, alcohol-drinking habit was categorized as 0 (nondrinker or periodic drinker) or 1 (everyday drinker). The explanatory variables were age, gender, and rs671 genotype. Gamma-GTP level was not used because it does not likely affect the alcohol-drinking habit. Gender was coded as 1 (male) and 2 (female), and rs671 genotype was coded as -1 (AA), 0 (AG) and 1 (GG). Note that the explanatory variables age, gender, and rs671 genotype are theoretically independent from each other, although there was an association between age and gender (Fig. 2). Table 6 shows the results of the logistic multiple regression analysis. As shown, elder age, male gender, and G allele at rs671 significantly increase the probability of everyday drinking. P values were the lowest for the rs671 genotype, followed by gender and age. These results were as expected and the interpretation was straightforward.

Table 6 Logistic regression analysis for alcohol-drinking habit

In the next step, data was analyzed by linear multiple regression analysis with log gamma-GTP level as the criterion variable. In fact, log gamma-GTP level does not necessarily fit a normal distribution; however, it is much better than raw gamma-GTP level (data not shown). Explanatory variables were age, gender, alcohol-drinking habit, and rs671 genotype. Gender was coded as 1 (male) and 2 (female), whereas alcohol-drinking habit was coded as nondrinker (0), periodic drinker (1), and everyday drinker (2). The rs671 genotype was coded as −1 (AA), 0 (AG), and 1 (GG). There is a limitation in this analysis because explanatory variables are dependent on each other. Table 7 shows the results of this analysis. Although the results suggest that gender, age, alcohol-drinking habit, and rs671 genotype are associated with gamma-GTP level, the interpretation is not easy, because explanatory variables are associated with each other in a complicated manner (Fig. 2).

Table 7 Linear multiple regression analysis

Subgroup analysis to examine the dependence of the association on other factors

Although investigation of the relationship between various factors as in Fig. 2 is not easy, a straightforward method is to examine the association in a rather homogeneous subgroup. One may stratify the entire sample into subgroups according to categorical variables until the subgroup populations become rather homogenous. However, the problem is that one cannot classify the sample according to a continuous variable and that the final subgroups may be too small for statistical analysis. By taking advantage of the large size of our sample, we attempted to perform association studies using subdivided samples. There were three categorical variables (gender, alcohol-drinking habit, and rs671 genotype). If we used all three categorical variables, we could divide the entire sample into 18 subgroups. However, the test of association using each of the 18 subgroups was not realistic. We may be able to ignore the effect of some factors because of the evidence of independence or very little dependence.

Association between rs671 genotype and alcohol-drinking habit

We first wanted to show the association between alcohol-drinking habit and rs671 genotype independent of other factors. Since both age and gender were independent of rs671 genotype (Fig. 2c, j), these two factors could be ignored when the association between alcohol-drinking habit and rs671 genotype was considered. Gamma-GTP level can never be the cause of rs671 genotype and is not likely to be the cause of alcohol-drinking habit, and is also ignored. Therefore, the three factors, age, gender, and gamma-GTP level, could be ignored when we examined the independence of the association between rs671 genotype and alcohol-drinking habit. Since we previously showed the association between rs671 genotype and alcohol-drinking habit (Table 5; Fig. 2e), they are really associated with each other independent of gender and age. Then, we examined the association between rs671 genotype and alcohol-drinking habit in male and female subjects separately. Table 5 indicates that the association between rs671 genotype and alcohol-drinking habit was strong even when data for male and female subjects were analyzed separately. Thus, rs671 genotype was associated with alcohol-drinking habit independent of other factors.

Association between alcohol-drinking habit and gamma-GTP level

We next investigated whether the association between alcohol-drinking habit and gamma-GTP level (Fig. 2f) was independent of other factors by subgroup analysis. Since gender and rs671 genotype were independent of each other (Fig. 2c), the effects of those factors on the association were studied separately. Figure 4b, c indicates that the association between alcohol-drinking habit and gamma-GTP level was replicated in each gender. Next, we examined the association between alcohol-drinking habit and gamma-GTP level within each genotype. Comparison of gamma-GTP levels between the subjects with different alcohol-drinking habits in the group of AA genotype was not possible because of the small sample size. Therefore, we compared gamma-GTP levels between subjects with different alcohol-drinking habits in each of the groups of AG and GG genotypes (Fig. 5). Even among the subjects with the same genotypes (AG or GG), there were significant differences in gamma-GTP levels (Fig. 5a, b). In addition, we examined whether there was an association between alcohol-drinking habit and gamma-GTP level in male subjects with the GG genotype at rs671 (n = 265). Even in such a subgroup with the same gender and the same genotype, there was a difference in gamma-GTP level between different alcohol-drinking habits (Kruskal–Wallis test, P = 1.75 × 10−5, AA = 29, AG = 75, GG = 161). Therefore, the association between alcohol-drinking habit and gamma-GTP level was independent of other factors.

Fig. 5
figure 5

Comparison of gamma-glutamyltranspeptidase (gamma-GTP) levels between subjects with different alcohol-drinking habits in the AG and GG genotypes at rs671. a Data from subjects with the genotype AG were analyzed. b Data from subjects with the genotype GG were analyzed. ND nondrinker, PD periodic drinker, ED almost everyday drinker

Association between rs671 genotype and gamma-GTP level

The association between gamma-GTP level and rs671 genotype was the first association detected by the association scan. We then examined whether the association between rs671 genotype and gamma-GTP level is independent of other factors. Since gender and age were independent of rs671 genotype (Fig. 2c, j), we examined whether the association was dependent on alcohol-drinking habit to show that the association was independent of other factors. However, since associations between gender and alcohol-drinking habit and between gender and gamma-GTP level were very strong, we first examined the association between rs671 genotype and gamma-GTP level conditional on gender. In male subjects, the gamma-GTP levels between the three genotypes were significantly different as tested by the Kruskal–Wallis test (P = 3.02 × 10−5, Table 2, Fig. 6). When the heterozygotes (AG) were combined with either AA or GG homozygotes, the tests of the differences between two subgroups by Mann–Whitney test showed low P values (AA vs. AG + GG, P = 0.000275; AA + AG vs. GG, P = 0.000146) (Table 2). In general, P values were lower when only male samples were examined than when the entire sample was used (Table 2). The results of the comparison of gamma-GTP levels in different genotypes in only the female samples are shown in Table 2 and Fig. 7. The Kruskal–Wallis test showed a P value of 5.74 × 10−5. Comparison between AA and AG + GG showed a P value of 0.3446, whereas comparison between AA + AG and GG showed a P value of 1.05 × 10−5 (both by Mann–Whitney test).

Fig. 6
figure 6

Comparison of gamma-glutamyltranspeptidase (gamma-GTP) levels between different rs671 genotypes in male subjects

Fig. 7
figure 7

Comparison of gamma-glutamyltranspeptidase (gamma-GTP) levels between different genotypes at rs671 in female subjects

Then the association between rs671 genotype and gamma-GTP level was examined conditional on alcohol-drinking habit. When the subjects were limited to nondrinkers (n = 166), the subjects with the AA genotype had higher gamma-GTP levels than those with AG or GG (Table 8, P = 0.02, Mann–Whitney test). When the same association was examined using all subjects, the presence of the G allele was positively associated with gamma-GTP level (Table 2; Fig. 1). The apparently paradoxical association between the rs671 genotype and gamma-GTP level disappeared when the analysis was performed in male and female subjects separately (Table 8). P values obtained were all above 0.05 in male and female subjects (Table 8). The apparently paradoxical association between rs671 genotype and gamma-GTP level observed in the entire sample is explained by having both genders in the analysis. Thus, the proportion of AA genotype among nondrinker female subjects was 14/91 = 0.154, whereas that among nondrinker male subjects was 28/75 = 0.373 (P = 0.00022, chi-square test). This is because the proportion of nondrinkers among subjects with AG or GG genotype was higher in female (77/278 = 0.277) than in male subjects (47/404 = 0.113) (P = 1.57 × 10−7, chi-square test). Since gamma-GTP levels were higher in male than in female subjects (Fig. 3), even when the analyses were performed in nondrinkers (Table 8), the subjects with the AA genotype paradoxically showed higher gamma-GTP levels than the other genotypes when the analysis was done in nondrinkers (male and female subjects combined). The results of the analyses of the associations between rs671 genotypes and gamma-GTP levels in the subjects with a nondrinking habit suggest that the association between rs671 genotype and gamma-GTP level is dependent on alcohol-drinking habit. Therefore, the association between rs671 genotype and gamma-GTP level is not independent of other factors, and is an apparent association.

Table 8 Association between gamma-glutamyltranspeptidase (gamma-GTP) levels and genotypes at rs671 in nondrinkers

Association between gender and gamma-GTP level

Then we examined whether the association between gender and gamma-GTP level (Fig. 2b) was dependent on other factors. Gamma-GTP levels were significantly higher in male than in female subjects among nondrinkers (P = 1.10 × 10−12, Mann–Whitney U test, Fig. 3b), periodic drinkers (P < 2.22 × 10−12, Mann–Whitney U test, Fig. 3c), and everyday drinkers (P < 2.22 × 10−12, Mann–Whitney U test, Fig. 3d). These results indicate that gamma-GTP level is associated with gender independent of the alcohol-drinking habit. To show that the association between gamma-GTP level and gender is independent of other factors, the effect of age on the association should be examined. However, the test of correlation between age and gamma-GTP level in male and female subjects separately (but not stratified with the alcohol-drinking habit) has shown that the association was not significant either in male (P = 0.058, Spearman’s rank correlation test, r = 0.221) or in female subjects (P = 0.213, Spearman’s rank correlation test, r = 0.133). Therefore, the association between gender and gamma-GTP level is not likely to depend greatly on age.

Association between age and gamma-GTP level

We examined whether the association between age and gamma-GTP level was independent of other factors. Figure 8 shows the relationship between age and log gamma-GTP level in male subjects who were nondrinkers (Fig. 8a), periodic drinkers (Fig. 8b), or everyday drinkers (Fig. 8c). There were significant positive correlations between age and log gamma-GTP level in both male periodic drinkers (Fig. 8b, P = 4.7 × 10−5) and male everyday drinkers (Fig. 8c, P = 5.0 × 10−5). Although there was a nonsignificant correlation between age and gamma-GTP level in male nondrinkers (Fig. 8a), it may reflect a small sample size (n = 75). When the associations were tested in nondrinkers (Fig. 9a, P = 0.021), periodic drinkers (Fig. 9b, P = 6.75 × 10−7), and everyday drinkers (Fig. 9c, P = 0.0015) separately among female subjects, there were significant positive correlations.

Fig. 8
figure 8

Correlation between age and log gamma-glutamyltranspeptidase (gamma-GTP) levels in male subjects was analyzed. a The correlation was analyzed only in male nondrinkers. b The correlation was analyzed only in male periodic drinkers. c The correlation was analyzed only in male everyday drinkers. The test of correlation was performed with the null hypothesis of correlation coefficient (r) = 0

Fig. 9
figure 9

Correlation between age and log gamma-glutamyltranspeptidase (gamma-GTP) level in female subjects was analyzed. a The correlation was analyzed only in female nondrinkers. b The correlation was analyzed only in female periodic drinkers. c The correlation was analyzed only in female everyday drinkers. The test of correlation was performed with the null hypothesis of correlation coefficient (r) = 0

Therefore, gamma-GTP level was positively correlated with age independent of gender and alcohol-drinking habit.

Discussion

A scan of 4,190 SNPs in 199 different genes using 38 different quantitative traits to search for associations between genotypes and phenotypes detected an association between rs671 genotypes and gamma-GTP levels. P values, however, were far above the significance level after Bonferroni’s correction. Since the association between ALHD2 genotypes and gamma-GTP levels has already been reported (Takagi et al. 2001) and is supported by functional evidence (Impraim et al. 1982), the association is likely to be true. The results of this study indicate how difficult it is to detect associations from thousands of SNPs and many quantitative traits considering the multiple-comparison problem. However, replication studies may successfully identify new associations between genotypes and phenotypes.

Even if we confirm the association between an SNP and a phenotype, it is not necessarily a direct association. However, although difficult, it is important to decide which association is direct and which is not. Theoretical consideration is sometimes useful; however, careful statistical analyses of the obtained data may help deepen the understanding of the associations.

We first examined the associations between pairs of five factors: rs671 genotype, alcohol-drinking habit, gender, gamma-GTP level, and age. Surprisingly, statistical associations were observed in all pairs of five factors except for the pair of rs671 genotype and gender and rs671 genotype and age (Fig. 2). It is unlikely, however, that all apparent associations between the pairs of factors reflect direct associations.

Our logistic regression analyses, by making the alcohol-drinking habit the criterion variable and rs671 genotype, gender, and age the explanatory variables, have shown that all three explanatory variables are associated with the alcohol-drinking habit. Linear multiple regression analyses, by making gamma-GTP level as the criterion variable and age, gender, alcohol-drinking habit, and rs671 genotype as explanatory variables, have indicated that all explanatory variables are associated with gamma-GTP level. However, more precise analyses were necessary to examine whether the associations are independent of other factors.

Subgroup analysis was very useful to examine whether an association of a pair of two variables was independent of the factors used for the stratification. In this way, the associations between rs671 genotype and alcohol-drinking habit, alcohol-drinking habit and gamma-GTP level, gender and gamma-GTP, and age and gamma-GTP were judged independent of other factors. Conversely, the association between rs671 genotype and gamma-GTP level was considered dependent on the alcohol-drinking habit. Although the direct associations between rs671 genotype and alcohol-drinking habit and between alcohol-drinking habit and gamma-GTP level are supported by functional evidence and previous reports, the associations between gamma-GTP level and gender and gamma-GTP level and age are of interest. The associations between gamma-GTP and age (Puukka et al. 2006) and gamma-GTP and gender (Stewart et al. 2007) have been previously reported.

The associations between genotypes and multiple phenotypes are often very complicated. Previous studies have shown the association between ALDH2 genotypes and hypertension (Tsuritani et al. 1995). In addition, gamma-GTP levels have been reported to be associated with hypertension (Takagi et al. 2001; Shankar and Li 2007), cardiovascular event (Fraser et al. 2007b; Patel et al. 2007; Yang et al. 2007; Hozawa et al. 2007; Turgut et al. 2006; Lee et al. 2006), type 2 diabetes (Doi et al. 2007; Fraser et al. 2007a; Lim et al. 2007), obesity (Lim et al. 2007), adiponectin (Kim et al. 2007), hyperlipidemia (Lippi et al. 2007), and metabolic syndrome (Andre et al. 2007; Lee et al. 2007). Although most of these associations are thought to be indirect and dependent on other factors such as alcohol-drinking habit, careful analyses of the data from the statistical viewpoint are necessary.

Associations between genetic and environmental factors will become a focus of medical and biological studies, since recent genome-wide association studies are expected to identify many association candidates (Ozaki et al. 2002; Suzuki et al. 2003; Kochi et al. 2005; The Wellcome Trust Case Control Consortium 2007). Our study reported here shows that (1) a large sample size combined with a replication study is necessary to overcome the multiple-comparison problem, and (2) subgroup analysis along with logistic and linear multiple regression analysis may be useful to dissect the complicated relationship.