Introduction

Disturbances in the dopamine neurotransmitter system have long been suggested to play a crucial role in the pathogenesis of schizophrenia (Prasad et al. 2002). However, involvement of dopamine-related genes in the manifestation of this disorder has been difficult to determine. The dopamine D4 receptor (DRD4) gene, located at chromosome 11p15.5 (Gelernter et al. 1992; Petronis et al. 1993), has received considerable interest because clozapine, a neuroleptic which is often effective against treatment-resistant symptoms, has a high affinity for this receptor (Sanyal and Van Tol 1997; Van Tol et al. 1991). Also, DRD4 is up-regulated in postmortem brain tissue from schizophrenic patients (Murray et al. 1995; Seeman et al. 1993; Stefanis et al. 1998; Sumiyoshi et al. 1995).

To investigate its genetic association with schizophrenia, many groups have examined polymorphisms of the DRD4 gene. Early studies focused on a 48-bp variable number of tandem repeat (VNTR) in exon 3. This tandem repeat comprises 2 (2R) to 11 (11R) 48-bp repeat units and codes for the third intracellular loop of the receptor protein (Van Tol et al. 1992). There is growing genetic evidence linking the VNTR with psychiatric traits or illnesses, such as novelty-seeking (Benjamin et al. 1996; Ebstein et al. 1996), attention-deficit hyperactivity disorder (ADHD) (Faraone et al. 2001; LaHoste et al. 1996), and schizophrenia (Hwu et al. 1998; Jonsson et al. 1996; Kaiser et al. 2000; Sanak et al. 2005; Serretti et al. 2001), but the results are still variable.

In recent years, promoter region polymorphisms of the DRD4 have also received particular attention because of their possible role in the regulation of gene transcription. To date, two polymorphisms in the promoter, T allele of the single nucleotide polymorphism (SNP) −521T>C and a 120-bp tandem duplication (1.2 kb upstream from the initiation codon), are reported to reduce transcriptional efficiency (D’Souza et al. 2004; Okuyama et al. 1999). Numerous studies have examined the association of upstream (promoter region and exon 1) polymorphisms with schizophrenia and some have shown a positive association (Okuyama et al. 1999; Xing et al. 2003), although not all were in full agreement (Hong et al. 1998; Jonsson et al. 2001; Kohn et al. 1997; Lung et al. 2006; Mitsuyasu et al. 2001; Petronis et al. 1995).

Since previous studies focused primarily on individual polymorphisms, allelic heterogeneity and/or a relatively weak effect of the variants may partly explain these inconsistencies. When haplotype construction is difficult because of a weakness or absence of linkage disequilibrium (LD) in a genomic interval, as is the case for the promoter region of DRD4, it may be important to test for a synergistic interaction between genetic variants in addition to performing single SNP analysis. Haplotype analysis is a reasonable strategy, assuming that the causative allele arises from a specific ancestral haplotype. However, this method is inadequate when there is an accumulation of multiple causative variants in a restricted genomic stretch. Additionally, haplotypic analysis is further compromised in regions where LD is decayed, because of the low statistical power to detect association due to inflated degrees of freedom. Therefore, clusters of potentially functional variants in genomic region with tenuous LD are better analyzed using a logistic regression framework that provides a powerful test of SNP etiology.

In this study, we examined the 5′ upstream region of DRD4, by first re-sequencing the region to identify all genomic variants. We then tested the association of all identified/validated common variants individually, as well as performing synergistic interaction analyses between them and schizophrenia.

Materials and methods

Subjects

Samples from 570 unrelated cases of schizophrenia (285 men, 285 women; mean age 47.0±11.4 years), and 570 age and sex matched controls (285 men, 285 women; mean age 46.7±11.1 years) were analyzed. The diagnosis of schizophrenia was made by consultation according to DSM-IV criteria with consensus from at least two experienced psychiatrists. All available medical records were taken into consideration. Control subjects were recruited from hospital staff and volunteers who showed no evidence of present or past psychoses during brief interviews with psychiatrists. All subjects were from central Japan. The study was approved by the Ethics Committees of RIKEN, Hamamatsu University and Chiba University. All participants provided written informed consent.

Sequence analysis

The upstream region encompassing the promoter region and exon 1 of the DRD4 gene (from 1,216 bp upstream of “A” in the start codon to 293 bp downstream of this “A”) was analyzed by the direct sequencing of PCR amplification from the genomic DNA of 30 unrelated Japanese schizophrenics. Primer sequences and detailed information on the reaction conditions are available upon request. Sequencing was performed using the DYEnamic ET terminator cycle sequencing kit (Amersham Biosciences, Piscataway, N.J., USA) and the ABI PRISM 3730 Genetic Analyzer (Applied Biosystems, Foster City, Calif., USA). Polymorphisms were detected by the SEQUENCHER program (Gene Codes Corporation, Ann Arbor, Mich., USA).

Genotyping

SNPs were typed by the TaqMan system (Applied Biosystems). PCR was performed using an ABI 9700 thermocycler and fluorescent signals were analyzed by an ABI 7900 sequence detector single point measurement and SDS v2.0 software (Applied Biosystems). Conflicts or flagged alleles were resolved by re-genotyping. Two microsatellite marker loci, the 120-bp tandem duplication and the 12-bp repeat polymorphism (Fig. 1), were amplified by PCR using fluorescently labeled primers. PCR fragments were analyzed on an ABI PRISM 3730 Genetic Analyzer (Applied Biosystems). Genotypes were determined using GeneScan 3.5.2 and Genotyper 3.6 software (Applied Biosystems).

Fig. 1
figure 1

Genomic structure and location of polymorphic sites within DRD4. Exons are denoted by boxes, with untranslated regions in white and translated regions in black. The sizes of exons and a scale are also shown. SNPs and microsatellites are indicated by solid squares and triangles, respectively. Re-sequencing was performed on the upstream region (from 1.2 kb upstream of the initiation codon to the 3′ end of exon 1)

Statistical analyses

The allelic and genotypic distributions were tested for association by Fisher’s exact test for biallelic markers (SNPs, the 120-bp tandem duplication and the 12-bp repeat polymorphism).

Haplotypic association analysis was performed using COCAPHASE in the UNPHASED package (http://www.rfcgr.mrc.ac.uk/fdudbrid/software/unphased/) (Dudbridge 2003). We employed a 2- and 3-marker sliding window analysis using COCAPHASE. To estimate the degree of LD between pairs of loci, the standardized disequilibrium coefficient (D′) was calculated and haplotype blocks were defined using the Haploview program (http://www.broad.mit.edu/mpg/haploview/) (Barrett et al. 2005).

Logistic regression analysis

To test the multiple polymorphisms in the upstream region for a synergistic effect, stepwise logistic regression analysis was performed using the SPSS Software (Release 11.0J) (SPSS Japan, Tokyo, Japan). We employed a procedure of backward selection in which we started with all the markers genotyped in the present study. According to the approach of Cordell and Clayton (2002), we set dummy variables, x 1 and x 2, taking values x 1=−1, 0 and 1, x 2=−0.5, 0.5 and −0.5, for genotypes w/w, w/m and m/m, respectively (w: wild-type, m: mutant).

Results

We re-sequenced the upstream region of the DRD4 gene, and identified a total of eleven SNPs: −1106T>C, −930C>G, −906T>C, −873G>A, −809G>A, −616G>C, −603G>T, −595G>del, −521T>C, −376C>T, and −291C>T, and a 12-bp repeat polymorphism (Fig. 1). One novel variant, −930C>G (deposited into GenBank as ss61570833), was detected in only 1 out of 58 chromosomes. The remaining variants were already present in the public databases and were validated in Japanese subjects during the course of this study.

The eight markers, −1106T>C, −906T>C, −809G>A, −616G>C, −521T>C, −376C>T, −291C>T, and the 12-bp repeat polymorphism in exon 1, were selected for genotyping using the criterion of a minor allele frequency >0.05 from our sequencing data. We also typed the size of the 120-bp tandem duplication (bi-allelic polymorphism with either one or two repeat alleles). The information on markers is shown in Table 1. The 12-bp repeat polymorphism was mostly bi-allelic with the exception of four chromosomes (three from schizophrenics and one from controls) that contained a three-repeat allele.

Table 1 Genotype distributions of bi-allelic polymorphisms from the upstream region of DRD4

Allelic and genotypic distributions of the promoter SNP −809G>A differed significantly between cases and controls (P=0.018 and 0.032 for allelic and genotypic comparisons, respectively). In the exploratory haplotype window analysis, the two-SNP haplotype [(−1106C) − (−906C)] tended to be over-represented in schizophrenia (frequencies in cases and in controls were 0.70 and 0.66, respectively, P=0.0656) (data not shown).

In LD analysis, D′ values were close to 1 in the distal [(120-bp repeat) − (−1106T>C) − (−906T>C)] and proximal [(−376C>T) − (−291C>T) − (−12-bp)] segments of the upstream region, with each defined as a haplotype block, but it did not hold for the region in between (Fig. 2). Decay of LD in this short stretch (1.2 kb) may imply a high local recombination rate. Therefore, in addition to conventional haplotype-window analysis, we employed logistic regression analysis, which can test a combinatorial effect of multiple SNPs simultaneously. A combination of four polymorphisms (−809G>A, −616G>C, −219C>T and 12-bp repeat) appeared to affect susceptibility to schizophrenia (Table 2).

Fig. 2
figure 2

Haplotype organization of DRD4 in Japanese. The haplotype block pattern was constructed by the Haploview program using the genotype data from both case and control samples (1,140 subjects). The number in each cell represents the LD parameter D′ (×100), blank cells mean D′=1. Each cell color is graduated relative to the strength of LD between markers, which is defined by both the D′ value and confidence bounds on D′. Strong LD was observed in the discrete distal [(120-bp repeat) − (−1106T>C) − (−906T>C)] and proximal [(−376C>T) − (−291C>T) − (12-bp repeat)] segments, but not in the intervening region

Table 2 Logistic regression analysis in the upstream region of DRD4

Discussion

In this study of the DRD4 gene, we focused on the upstream region, where associations with psychiatric phenotypes have been previously reported. However, genetic variants in this region do not show consistent association with schizophrenia. This may have been due to insufficient statistical power or inadequate interrogation of genomic variations in some studies. In addition, the differences in marker sets between studies increase the ambiguity when interpreting data. To minimize these problems, we adopted the thorough approach of re-sequencing and genotyping all validated polymorphisms.

Both upstream region variants, the single marker −809G>A and a multi-marker combination, displayed association with schizophrenia in this study. The SNP −521T>C, which was previously reported to show association with schizophrenia in an independent Japanese sample set (Okuyama et al. 1999), was not significant in our data set, as in other population data sets (Ambrosio et al. 2004; Jonsson et al. 2001; Segman et al. 2003). Since it is conceivable that particular combinations of multiple variants may confer an enhanced susceptibility to schizophrenia, we performed a logistic regression analysis to test all SNPs in the upstream region for association. This approach may be more appropriate than conventional haplotype analysis, when examining the upstream region of the DRD4 gene where there is evidence of LD decay in a relatively short genomic stretch and most markers do not correlate with each other. Results appear to indicate a synergistic interaction among the four promoter polymorphisms in the DRD4 gene. These results should be treated with caution since there are limitations to the present association study. Firstly, this is a case-control study, where false positive findings due to population stratification can occur. Analysis of this sample set using STRUCTURE software (Pritchard et al. 2000) detected no evidence of population stratification (Shimizu et al. 2006). To address this issue further, we also performed a transmission disequilibrium test. This testing failed to replicate the association of −809G>A in a sample set of 80 complete schizophrenic trios (data not shown). This sample size would have only a limited statistical power to detect true association. Secondly, the significance of the −809G>A polymorphism in the promoter region disappeared after Bonferroni correction for multiple testing. Thirdly, the sample size is obviously limited and so the insignificant result for −521T>C needs to be interpreted cautiously. The sample size in this study had a power of 0.59 to detect a susceptibility variant with a relatively small effect (relative risk = 1.2 and 1.44 for heterozygotes and homozygotes, respectively), when the allele frequency is 0.41. This frequency equals that of SNP −521C, whose association with schizophrenia has been previously reported (Purcell et al. 2003; Xing et al. 2003).

In conclusion, to our knowledge, this is the first study that highlights a possible combinatorial effect of promoter SNPs in the DRD4 gene. It will be interesting to determine whether this multi-marker association can be confirmed in independent data sets, where samples are well-defined in terms of clinical variables such as symptoms, age-at-onset, severity, medication type, and response to antipsychotic treatment. Furthermore, it will be important to examine the functional effects of variant combinations, to understand the mechanisms by which they increase or decrease susceptibility to schizophrenia.