Introduction

Schizophrenia (SCZ) and bipolar affective disorder (BPAD) are severe psychiatric disorders, each affecting approximately 1% of the population worldwide.1 Family, twin and adoption studies confirm that genetic factors have a significant role in the aetiology of these disorders.2 Several putative susceptibility loci and genes have been implicated, of which the TSNAX/DISC1 locus has shown encouraging results. The gene disrupted in schizophrenia 1 (DISC1) was first identified in a large Scottish family with a balanced translocation t (1, 11) (q42.2, q14.3) segregating with major mental illness, including SCZ, BPAD and recurrent major depression.3, 4, 5 The translin-associated factor X (TRAX; TSNAX) gene is located adjacent to the DISC1 towards the 5′ region.

Subsequently several linkage studies have observed linkage near 1q42, which harbours DISC1, to SCZ6, 7, 8, 9 and BPAD.10, 11, 12 Several population-based and family-based association studies have supported the role of the TSNAX/DISC1 locus in SCZ and/or BPAD in Caucasian13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 and Asian26, 27, 28 populations. However, other studies have failed to replicate association between TSNAX/DISC1 and SCZ/BPAD.29, 30, 31, 32, 33, 34 Recent meta-analysis found no evidence of association with SCZ with the variants in the DISC1.35 A meta-analysis of nine European studies has supported common SCZ risk interval within DISC1 intron 4–6 with P=0.002, odds ratio =1.27.24 There is also evidence for DISC1 being associated with poorer performance of visual attention and working memory.16, 26, 36, 37, 38 Association of DISC1 with psychosis-related traits has been reported in Finnish population.39 A genome-wide association study has shown nominal association for SCZ with the SNPs near the DISC1 translocation break point.40 These results support a role for TSNAX/DISC1 genetic variation in conferring susceptibility to both SCZ and BPAD. Variation at the TSNAX/DISC1 region thus may have a role in susceptibility to severe psychiatric illness, though the risk variants are not clearly defined, and the haplotypes vary across populations.

Several reviews of the genetic epidemiology and molecular linkage studies suggest an overlap between genetic susceptibility to SCZ and BPAD,41, 42, 43 indicating that these two diagnostic categories may share risk factors. The TSNAX/DISC1 locus could be one of these overlapping genetic susceptibility regions for psychosis. This is also supported by the observation that the translocation was found in both BPAD and SCZ phenotypes in the Scottish family, and a previous study13 in this region has shown association with a combined phenotype of SCZ and BPAD, along with schizoaffective disorder, SCZ spectrum diagnosis.

The full-length human DISC1 protein (854 amino acids) is predicted to consist of an N-terminal region, often referred to as the globular ‘head’ domain and an alpha-helical coiled-coil-containing C-terminal region.4, 44 DISC1 protein interacts with several proteins, which are involved in neuronal migration, cytoskeleton associated movements, cell cycle, signal transduction, intracellular transport/exocytosis, golgi and central nervous system development.45,46,47 The TSNAX (290 amino acids) protein has an important role in brain development by forming brain-enriched complexes with the protein, Translin.48 There is evidence of intergenic splicing and formation of fusion transcripts between TSNAX and DISC1. Five species of transcripts resulting from intergenic splicing have been identified, of which one encodes a novel TSNAX/DISC1 fusion protein.49

Association between polymorphisms in TSNAX/DISC1 and the risk of developing SCZ and BPAD in the Southern Indian population has not been evaluated before, and the present study was conducted to detect the associations reported in other populations. A case–control design was used in order to test the association of SNPs between cohorts of SCZ cases, BPAD cases, cases with a combined phenotype (SCZ or BPAD) and healthy controls. As there were previous reports13, 15, 27 of sex-specific associations, we also analysed the data based on the gender.

Materials and Methods

Subjects

All individuals included in this study were unrelated and of Southern Indian ethnic origin. The study sample comprised of 419 patients with diagnosis of BPAD type I (225 males and 194 females; mean age at assessment: 29.02 ±10.98 years, mean age at onset: 20.73 ±6.96 years) and 408 patients with diagnosis of SCZ (246 males and 162 females; mean age at assessment: 29.08 ±7.84 years, mean age at onset: 23.21 ±7.16 years) recruited from the inpatient and outpatient services of National Institute of Mental Health and Neuro Sciences (NIMHANS), Bangalore, India. All patients were evaluated using the Diagnostic Interview for Genetic Studies.50 Patients were diagnosed by at least two experienced Psychiatrists using Diagnostic and Statistical Manual of Mental Disorders-IV (DSM-IV) criteria.51 Control subjects (N=425; 243 males and 182 females mean age: 33.61±11.15 years) were recruited by a standardized semi-structured interview with psychiatrists from healthy volunteers and hospital staff who had no family history of axis I disorders, in first degree relatives.52 Cases and controls were statistically matched for gender (Supplementary Table 1).

The study was approved by the Institute ethics committee. All participants gave written informed consent after a complete description of the study.

SNP Selection and Genotyping

Seven SNPs spanning 492 kilo bases from intron 4 of the TSNAX to the 3′-untranslated region of the DISC1, which have shown significant association with SCZ and/or BPAD in other populations, were selected for the study (Figure 1). In all, 10 ml of venous blood was collected from the participating subjects and genomic DNA was isolated using modified salting-out method. 53 The SNPs were genotyped in two sets using the ABI PRISM SNaPshot Multiplex Kit (Applied Biosystems, Foster City, CA, USA). PCR was carried out using an ABI 9700 (Applied Biosystems)/eppendorf thermocycler (Eppendorf Scientific, Inc., Westbury, NY, USA). Template preparation after PCR was performed using shrimp alkaline phosphatase and exonuclease I and post-extension treatment was done using shrimp alkaline phosphatase. SNaPshot products were analysed using an ABI PRISM 377 DNA Sequencer (Applied Biosystems). Genotypes were determined using Gene Scan v 3.1 and Genotyper v 2.1 software (Applied Biosystems). Allele scoring was performed blind to diagnosis, and two raters confirmed the scoring. The genotyping call rate was 97.8%. Conflicts in allele scoring were resolved by individual SNP genotyping.

Figure 1
figure 1

Schematic diagram of exons of TSNAX (grey boxes), the disrupted in Schizophrenia 1 (DISC1) (black boxes), and the intergenic exons (white boxes). The seven SNPs studied are shown in the context of the exonic structure of these genes. The positive reports are shown with inter-marker distance. The figure is adapted from Hennah et al. 2003.

As genetic case–control studies are known to be susceptible to population stratification, we genotyped six unlinked Alu polymorphisms in 100 individuals from each category (N=300) to assess the genetic homogeneity of our samples. Alu repeats are a family of short interspersed elements and are predominantly present in the non-coding regions of the primate genome. Alu repeats have no known biological function54 and have been used to study the history of substitution patterns in the human genome.55 Polymorphic Alu insertions have been used as markers to determine genetic distances between human sub-populations.56 We did not observe any difference in the allelic or genotypic distribution between the samples, or between the sexes (Supplementary Table 2).

Statistical analysis

Mann–Whitney test was performed to check for age differences between cases and controls. The difference in age between cases and controls was found to be statistically significant. Binary logistic regression analysis (SPSS version 15, SPSS Inc., Chicago, IL, USA) was done with genotypes recoded into the number of alleles (0, 1 and 2, with 1 being the heterozygote) and recoded genders (1 and 2, 1 being male) considering age as a covariate. Allele and genotype frequencies were also compared using standard Chi-square (χ2) tests. Haplotype analysis was performed using HAPLOVIEW v 4.0 (Broad Institute, Cambridge, MA, USA), which uses the expectation maximization algorithm57 to estimate haplotype frequencies. Hardy–Weinberg equilibrium was assessed for all markers using HAPLOVIEW v 4.0. In accordance with the common variant common disease hypothesis for complex disorders, rs6675281 and rs11122396 were excluded from further analysis as the minor allele frequencies were found to be <5% in controls. The frequency of rs11122396 has been seen to be quite low (2%) also in Caucasians15 and that of rs6675281 was higher (12 to 14%) in Caucasians and Asian studies compared with our study.14, 15, 20, 24, 58 rs1615409 was also excluded from the analysis as it was in strong linkage disequilibrium with rs766288 (controls-D′=0.95, r2=0.606; SCZ-D′=0.962, r2=0.562 and BPAD-D′=0.969, r2=0.566). The linkage disequilibrium structure for the four SNPs studied is given in Supplementary Figure 1. Association tests with p-values<0.05 were considered to be nominally significant. We corrected for multiple tests using false discovery rate (FDR) approach.59 We used an overall FDR of 0.05 for our association tests. As we performed tests for four SNPs, we used the FDR correction for four tests for the single-marker analyses. The corrections were done within each gender and phenotype separately. For two loci analysis, a combination of ‘allele 1’ in rs766288 and ‘allele 2’ in rs2812393 was considered the risk haplotype for regression analysis. The number of copies of the risk alleles was coded as 0, 1 or 2. For example for C–C risk combination, C/C–C/C coded as 2, C/C–C/G or C/A–C/G or C/A–C/C coded as 1 and C/C–G/G, C/A–G/G, A/A–G/G, A/A–C/C and A/A–C/G coded as 0. As there were four allelic combinations in each of the two-locus analysis, the FDR corrections were implemented for four tests. The corrections were done within each gender and phenotype separately. Power calculations were performed by using QUANTO 1.2 software (http://hydra.usc.edu/gxe). The power of our study sample was calculated based on the observed range of allele frequencies, the odds ratio of 1.2, prevalence of 1%, a 1:1.041 case–control ratio for BPAD and a 1:1.014 case–control ratio for SCZ under a log additive genetic model at the 0.05 significance.

Results

Single-Marker analysis

SNP frequencies for cases and control groups did not deviate significantly from the Hardy–Weinberg equilibrium cutoff (P<0.001). One SNP (rs821616) showed moderate significant association (P<0.05) with cases compared with controls in regression analysis (Table 1). Associations were found to be significant with BPAD phenotype (P=0.035) and combined phenotype of BPAD or SCZ (P=0.04). However, the results did not retain statistical significance after multiple corrections at level 0.05 (n=4). As there were reports of gender-specific effects we analysed the samples stratified by gender and significant association was found at rs766288 and rs2812393 for females (Table 1). The most significant association was found for rs766288 with BPAD females (P=0.004) and combined female cases of BPAD or SCZ (P=0.004), which withstood FDR corrections (n=4). Moderate associations were also observed in females for rs766288 (SCZ, P=0.03) and rs2812393 (SCZ, P=0.026; BPAD+SCZ, P=0.028). Binary logistic regression analysis for the combined sample comprising both males and females at rs766288 did not show any statistical significance after adjusting for gender (Supplementary Table 3). This could be due to the difference in allele frequencies in males (0.319) and females (0.371) at rs766288. However, there was significant evidence of genotype–gender interaction (Supplementary Table 4).Results of χ2 tests and odds ratio is shown in Supplementary Table 5 and Supplementary Table 6.

Table 1 Logistic regression analysis—genotype+age as covariate

Haplotype analysis

The haplotype tests of association were performed with all four-marker haplotypes. Haplotypes with estimated frequencies of >1% in both cases and controls were considered for analysis. Results of haplotype analysis with P<0.05 is shown in Table 2. C–G–G–T haplotype showed relatively lesser frequency in BPAD (P=0.028), SCZ (P=0.0189) and combined cases of BPAD or SCZ (P=0.0065) and combined males of BPAD or SCZ (P=0.0205) compared with controls. Haplotype A–G–G–T was also underrepresented in combined cases of BPAD or SCZ (P=0.0469), BPAD females (P=0.0273) and pooled females of BPAD or SCZ (P=0.0209). The nominal associations found did not remain significant after multiple corrections at level 0.05.

Table 2 Results from association analysis for four-marker haplotypes in TSNAX/DISC1

Two-locus analysis

Two-locus analyses were performed using binary logistic regression for the three SNPs in combinations of rs766288–rs2812393, rs766288–rs821616 and rs2812393–rs821616 as we found nominal associations for these loci.

rs766288–rs2812393

Analyses of these two SNPs showed significant associations with allelic combination C–C (P=0.009) in the BPAD phenotype when multiple corrections were performed for all tests within each stratified gender (combined, females, males). The regression coefficient for the C–C combination was positive indicating that this allelic combination increases the risk in BPAD. Female-specific combinations were also observed (Table 3). Other combinations in BPAD females (C–C, P=0.003, risk combination) and SCZ females (C–C, P=0.011, risk combination; A–G, P=0.016, protective combination), combined female cases (A–G, P=0.013, protective combination; C–C, P=0.001, risk combination) were found to be significantly associated after FDR corrections (n=12). If FDR corrections are performed only within a phenotype and gender-based analysis (n=4), we also obtained significance with C–C combination (P=0.016, risk combination) in BPAD cases.

Table 3 Two locus analysis of rs766288 and rs2812393

rs766288–rs821616

Allelic combination C–A (P=0.004) in the female cases with combined phenotype showed significant association when multiple corrections were performed for all tests within each stratified gender (combined, females, males). A positive regression coefficient value for the C–A combination was suggestive that female cases with this allelic combination are at increased risk. Few additional combinations that were found to be significantly associated after multiple corrections (n=12) were A–T in BPAD females (P=0.012, protective combination); C–A (P=0.014, risk combination), C–A in SCZ females (P=0.011, risk combination), A–T in combined female cases (P=0.012, protective combination) and A–A in combined female cases (P=0.022, protective combination) compared with control females (Table 4). If FDR corrections are performed only within a phenotype and gender-based analysis (n=4), we also obtained significance with A–A (P=0.032, protective combination) in BPAD females. However nominal significance observed in BPAD (C–A, P=0.024, risk combination), cases with the combined phenotype (C–A, P=0.019, risk combination), SCZ males (A–A, P=0.016, risk combination) and males with the combined phenotype (A–A, P=0.028, risk combination), which did not withstand the multiple corrections.

Table 4 Two locus analysis of rs766288 and rs821616

rs2812393–rs821616

Allelic combinations of G–T in SCZ (P=0), combined cases of BPAD or SCZ (P=0.001), SCZ females (P=0.007) and combined females (P=0.005) were observed to be protective combination of alleles (Table 5). These combinations survived multiple corrections (n=12). In addition allelic combinations of G–T (BPAD, P=0.017; BPAD females, P=0.035; SCZ males, P=0.02; combined males, P=0.039) showed nominal significance, but did not withstand FDR corrections (n=4).

Table 5 Two locus analysis of rs2812393 and rs821616

Discussion

In the present study, we observed nominal significance in the entire sample set for a non-synonymous SNP rs821616 (Ser704Cys) with BPAD and a combined phenotype of BPAD or SCZ. However these results did not withstand multiple corrections.

At the SNP rs766288, gender-specific association was observed which remained significant after multiple corrections. Female-specific associations in DISC1 have been reported earlier for SCZ at rs2295959 in Han Chinese27 and rs821616 in a Japanese population.60 An early study on a Finnish cohort with a broad phenotype, which included SCZ, schizoaffective disorder, SCZ spectrum diagnosis and BPAD, reported the under-transmission of a haplotype that comprised of rs751229 and rs3738401 at the 5′end of DISC1 to affected females.13 HEP2 (rs1615344–rs1615409–rs766288) (C–A–C) also showed significance in combined samples of males and females and also under-transmitted when samples were stratified by gender. Interestingly rs766288 showed significance in an 8-SNP haplotype in the TSNAX/DISC1 region of G–C–C–G–A–C–A–G (rs1630250–rs1615409–rs766288–rs1025526–rs1030711–rs751229–rs1285730–rs3738401) with BPAD males, in the Scottish population.15 In our study the C allele of rs766288 was overrepresented in BPAD females (72.9%), SCZ females (71%) and combined females of BPAD or SCZ (72.1%) as compared with female controls (62.9%). Further in our study C–A (rs766288–rs821616), A–A and A–T allelic combinations showed association with female cases. C–A seems to be a risk combination while A–A and A–T are likely to be protective. The two other two-locus analysis involving rs766288 also showed C as risk haplotype. C–C (rs766288–rs2812393) allelic combination seemed to be risk factor in BPAD, which was severe in females. Our results for rs766288 are in concordance with the previous reports and suggest that C might be the risk allele.

We also observed nominal associations for rs2812393. Two-locus analysis of rs2812393–rs821616 showed G–T as protective combination of alleles in gender combined samples of SCZ, BPAD or SCZ, as well as in females. C–C (rs766288–rs2812393) appeared to be a risk combination. Thomson et al.15 have reported an overrepresentation of G allele of rs2812393 in a three marker haplotype C–G–T (rs2492367–rs2812393–rs1322784) in BPAD and also in BPAD female subjects from Scotland. rs766288 and rs2812393 lie in intron 4 of TSNAX and intron 6 of DISC1 gene. As TSNAX and DISC1 are known to form fusion transcripts, these SNPs could be involved in the formation of isoforms via alternative splicing or could be associated with some other functional variant in the region. The mechanism of sex-specific effects of these genes could be through variation in hormone responsive promoter or enhancer sequences, affecting either TSNAX/DISC1 or other genes that interact with them.

Our results for Ser704Cys polymorphism in the complete sample set is in accordance with the report in a Caucasian sample with allele A overtransmission (71%) for SCZ.16 We also observed a relative excess of the A allele (Ser704) in cases (77.2% SCZ, 76.5% BPAD, 76.84% combined cases) compared with controls (72.2%). Discordant findings have been observed for this polymorphism with an excess of T allele in SCZ in a Chinese sample set (15.7% in cases and 10.4% in controls)28 and in another Caucasian sample (33% for cases and 27% for controls).20 These differences suggest that there could be allelic heterogeneity associated with the polymorphism. This SNP is in exon 11 of DISC1, an exon known to undergo alternative splicing and encodes a region of the protein thought to be involved in interactions with ATF4, ATF5, LIS1, GRB2 and NDEL1.45 The Ser allele has been shown to be associated with structural and functional changes in the hippocampus including reduction in grey matter volume in Caucasian data set.16 Ser704 homozygotes have shown greater pre-frontal activation, with significant differences in the left middle and inferior frontal gyri, and in the right inferior frontal gyrus, which could be interpreted as a manifestation of less efficient pre-frontal function, with more activation required to achieve the same behavioural output.61 Ser704Cys polymorphism is also shown to be associated with altered expression of transcripts lacking exon 3 and, exon 7 and 8.62 The short transcripts were found to be highly expressed in the brains of patients with SCZ than in control brains, and were associated with genetic variations in the DISC1 gene. Power analysis showed our sample set had power of 42.4% for BPAD and 42.6% SCZ for rs766288, 45.5% for BPAD, 44.1% for SCZ for rs2812393 and 40.5 and 35.4% for SCZ for rs821616 with an odds ratio of 1.2. These nominal associations observed in our sample could be a false positive finding due to inadequate statistical power. The possibility of involvement of Ser704Cys polymorphism in the susceptibility of psychosis still remains unclear.

The associations we have identified are unlikely to have arisen as a result of population stratification within our samples-firstly the allele frequencies of the analysed SNPs other than rs6675281 are congruent with the reported allele frequencies in Caucasian populations and secondly, the Alu markers did not show significant differences in the groups studied (Supplementary Table 2). However, these data should be interpreted with caution as there are chances of type I error.63

We could not replicate the association for the other SNP studied rs3738401 with SCZ or BPAD that have been reported previously in other ethnic groups. The difference in the ethnic background of the subjects, and the study design viz sample size, sampling bias, family vs population based, and so on between studies may account for inconsistent results. These reported SNPs might not be causative, and could be in linkage disequilibrium with some other disease predisposing allele within the gene or near the gene. The SNPs studied could be associated with SCZ or BPAD in a haplotype with other polymorphisms or could be a susceptibility allele only in subgroup of the phenotype. It is evident that most of the positive studies in this region have been from Caucasian populations, while findings in the Asian population have been mixed, with largely negative reports.37, 38, 39 This could be explained by the differences in the linkage disequilibrium pattern and differences in allele frequencies across populations. Alternatively, in complex diseases such as SCZ and BPAD, in which multiple loci act jointly to confer susceptibility for disease, associations of different alleles at the same locus could be attributed to a correlation with a causal polymorphism at another locus.64 However our results for rs766288, rs2812393 and rs821616 are suggestive of the involvement of this locus in susceptibility to BPAD and SCZ.

The present study has some limitations. Firstly, the number of SNPs studied might not represent the entire region, as TSNAX/DISC1 region comprises 610 kb. Thus, we cannot exclude the possibility of other SNPs in the region being involved in conferring risk for BPAD and SCZ. Second, the sample size for the current analyses was relatively modest, with further reduction for gender-based comparisons, and further replication is necessary.

However, it should also be noted that the multiple corrections applied here are very conservative taking into account our study design has a directional hypothesis based on previous findings. Despite the limitations, we believe our study contributes to greater insight into the TSNAX/DISC1 locus association with psychosis. In an era where genome-wide association study have rapidly become a standard method for disease gene discovery, there is still a need for focused candidate gene association studies in different populations, as most genome-wide association study have been with European sample sets, and limited reports are available from Asian populations,65,66,67 as there are likely to be population-specific variants.68

In conclusion, we find significant gender-specific associations with two intronic SNPs in TSNAX and DISC1, and an exonic SNP from DISC1. Further replication, in larger and more diverse samples, using dense markers, may be necessary to understand the role of these genes in the pathophysiology of these disorders.