Introduction

Bipolar disorder (BP) is among the more reliably diagnosed1 and most heritable illnesses in psychiatry.2 Although a recent large-scale genome-wide association study (GWAS) meta-analysis found strong evidence for association of common markers in the calcium channel subunit gene CACNAC1 and the cell-surface protein gene ODZ4 with BP, the effect sizes are modest and account for only a small proportion of the phenotypic variance and heritability.3 Several hypotheses have been proposed to explain why gene identification in psychiatric phenotypes such as BP has met with less success than might have been anticipated. Among these are a higher burden of rare variants,4 a more polygenic genetic architecture5 and a more heterogeneous phenotype.6

If the search for etiology is complicated by phenotypic heterogeneity, a potential method to mitigate this problem might be to delineate clinical subphenotypes that could be more biologically homogenous. Such phenotypes may also help uncover potential susceptibility risk factors for domains of psychopathology that may be shared across diagnoses. In this study, we focus on the Diagnostic and Statistical Manual of Mental Disorders (DSM)-IV subtype of BP with mood-incongruent psychotic features (MICP). This subphenotype was first described a century ago as a potential diagnostic tool to differentiate the more common mood-congruent psychotic symptoms of mood disorders from the mood-incongruent psychotic symptoms of SZ.7 More modern conceptions of BP no longer exclude cases with mood-incongruent psychotic symptoms, which are now believed to comprise one-third to one-half of all psychotic BP.8, 9 Mood-incongruent psychotic symptoms in BP appear to be important markers of severity, being associated with worse overall outcome,10, 11 poor lithium responsiveness12 and higher rates of attempted suicide.8, 9

The presence of MICP symptoms might also be a manifestation of overlapping genetic susceptibility between BP and SZ. Family studies have found a modestly elevated risk of SZ in relatives of cases with mood disorders and MICP,13, 14 and vice versa.15 More recently, two genome-wide linkage scans of BP with MICP have been performed;9, 16 while there was limited overlap between the two, a number of genome-wide suggestive signals emerged in regions previously implicated in SZ linkage studies, including on chromosomes 1p32, 2q12 and 13q31.

We hypothesize that the BP subtype with MICP features may help uncover susceptibility genes for BP not readily detectable by examining the broad BP phenotype. Given the clinical similarities between MICP symptoms in BP and in SZ, we also hypothesize that some of these susceptibility genes would show partial overlap with susceptibility genes for SZ. To test these hypotheses, we have performed a GWAS analyses of the MICP phenotype using several large BP data sets and further sought to determine whether our findings might also influence susceptibility to SZ, by making use of the recently published Psychiatric GWAS Consortium (PGC) SZ results.

Materials and methods

Subjects

Subjects with BP and MICP were initially drawn from three large, independent samples of BP, here termed the National Institute of Mental Health Bipolar Disorder Consortium (NIMH), the Wellcome Trust Case–Control Consortium (WTCCC) and the German samples. Subject collection and genotyping were conducted separately for each sample. However, quality control steps taken in the analyses of these samples have been identical.

NIMH sample

Eligibility, ascertainment and assessment procedures for the NIMH sample have been previously described.17 All subjects were assessed with the Diagnostic Interview for Genetic Studies (DIGS), and this was combined with family informant data and medical records to assign diagnoses based on DSM-III-R or DSM-IV criteria. Controls were ascertained throughout the United States and were genotyped as part of the Molecular Genetics of Schizophrenia II (MGS2) Collaboration.18 All control subjects completed a psychiatric questionnaire and those endorsing a history of BP, psychosis or symptoms sufficient for a diagnosis of major depression were excluded. Controls were matched to cases according to ethnicity and sex.

Wellcome Trust Case–Control Consortium

The WTCCC sample comprised the original 1868 participants with BP and 2938 controls from the WTCCC-1 study, and an additional 2725-independent controls from the WTCCC-2 study. Clinical assessment included a semi-structured interview and review of case notes. Ratings of symptom occurrence and course of illness were made using the Bipolar Affective Disorder Dimension Scale19 and/or the operational criteria (OPCRIT) item check list.20 Diagnoses were based on all available data using the Research Diagnostic Criteria.21 Both WTCCC-1 and WTCCC-2 controls were evenly ascertained from the 1958 British Birth Cohort and the UK Blood Service Control Group samples.

German sample

The collection and genotyping of the German BP sample has been recently described.22 In brief, probands with bipolar I disorder were recruited through consecutive hospital admissions and assessed using a structured interview and the OPCRIT checklist. Best- estimate diagnoses were assigned according to DSM-IV criteria. A population-based control sample screened for psychiatric disorders was assembled from the PopGen, KORA and Heinz Nixdorf Recall studies.

Mood-incongruent psychosis phenotype

Considerable attention was given to ensuring comparability between the phenotypes across the three samples. While the NIMH, WTCCC and German samples used different diagnostic instruments (that is, DIGS, Bipolar Affective Disorder Dimension Scale, OPCRIT) to diagnose participants, the psychotic symptoms queried by these interviews are quite similar and, in many cases, identical. In all studies, mood-incongruence was defined according to the DSM-IV definition as psychotic ‘content…inconsistent with depressive themes such as guilt, illness, personal inadequacy or catastrophe... [or] inconsistent with manic themes such as inflated worth, power, knowledge, identity, or special relationship to a deity or famous person.’ Further DSM-IV mood-incongruent psychotic symptoms include ‘delusions of thought insertion…of thought broadcasting, and…of control’.

For the NIMH samples, the designation of psychotic features was based on a lifetime history of auditory or visual hallucinations or delusions. In the DIGS subjects are asked about the presence of psychotic symptoms during their most severe depression, as well as during a subsequent review of lifetime psychotic symptoms. Interviewers were asked to determine the incongruence of psychotic content following descriptions of hallucinations or delusions in a mood episode. At least one positive response to these questions in either depressive or manic episodes rendered a subject positive for a lifetime history of mood-incongruence. If insufficient information was available to rule in or rule out a lifetime presence of mood-incongruent psychosis, the case was designated as indeterminate. In the genotyped NIMH sample of 2191 BP cases, 1372 (62.6%) were classified as having a lifetime history of psychosis and 960 (43.8%) met criteria for a lifetime history of mood-incongruent psychosis.

The WTCCC study used a semi-structured diagnostic instrument similar to the DIGS, which was then used to complete the Bipolar Affective Disorder Dimension Scale and OPCRIT rating scales. The designation of mood-incongruence was made if a case had a rating of >20 in the Bipolar Affective Disorder Dimension Scale mood-incongruent dimension or if the OPCRIT ratings (questions 54, 58, 59, 61–63, 66–68, 72–77) were positive for any of the DSM-IV mood-incongruent items. Out of the 1868 cases from the WTCCC-1 study, 873 (46.7%) were classified as having had MICP.

In the German study, the designation of mood-incongruence was made using the OPCRIT items that reflected the DSM-IV definition. Of the 645 cases with BPI, 363 (56.3%) were classified as having had mood-incongruent psychosis.

Genotyping

National Institute of Mental Health Bipolar Disorder Consortium. Unrelated cases and controls were genotyped in two separate stages by the Bipolar Genome Study. In all, 1001 cases and 1034 controls selected for the Genetic Association Information Network study were genotyped by the Broad Institute using the Affymetrix 6.0 platform (Santa Clara, CA, USA), while an additional 1190 cases and 401 controls were genotyped by the Translational Genomics Research Institute, also on the Affymetrix 6.0 platform. We included the final cleaned data set from the primary case–control analysis of each sample. The Genetic Association Information Network-BP sample included genotype data on 724 067 single-nucleotide polymorphisms (SNPs). The Translational Genomics Research Institute data set included genotype data on 728 187 SNPs.

Wellcome Trust Case–Control Consortium. Genotyping was conducted in two stages, with cases and controls available from WTCCC-1 and controls only from WTCCC-2. The combined WTCCC-1 and 2 samples consisted of 1868 cases and 5682 controls (2957 from WTCCC-1 and 2725 from WTCCC-2). There were 382 370 SNPs generated using Affymetrix 5.0 from stage 1 and 696 889 SNPs generated from the Affymetrix 6.0 platform in stage 2.

German sample. The genotyping of the German BP sample has been recently described.22 Genotyping was performed on the Illumina HumanHap550 array (Illumina Inc, San Diego, CA, USA). We utilized the final cleaned data set, which included 645 cases and 1310 controls with genotype data on 516 024 SNPs.

Data management

Quality control. Each data set underwent identical quality control steps. We dropped subjects with a missing data rate3%, and SNPs with a missing data rate5%. SNPs were also dropped if their frequency was<1% or if they had a Hardy–Weinberg equilibrium P<0.001. The quality control steps were applied to each study individually before and after imputation, as well as to the combined mega-analysis sample. To assess for cryptic relatedness, genotyped SNPs for each study were combined and identity by descent analysis was performed using a linkage disequilibrium-pruned data set. We removed one of any pair of individuals with identity by descent score sharing>0.2.

Principal component analysis. To account for population stratification in the GWAS sample, we performed principal component analysis within each data set using markers in linkage equilibrium (R2<0.2) with a minor allele frequency >5%. We also excluded all AT/GC SNPs and all SNPs within the major histocompatibility complex (chr6: 25–23 Mb) and chromosome 8 inversion (chr8: 7–13 Mb) regions.

Principal components were selected based on visualization of a screen plot (graphs of the principal components are shown in Supplementary Figure 1). Plotting of the principal components showed two, 11, and one clear outlier(s) in the NIMH, WTCCC and German data sets, respectively, that were removed.

Imputation. Imputation was performed to facilitate a meta-analysis of the three data sets, which were originally genotyped on Affymetrix 5.0 (WTCCC-1), Affymetrix 6.0 (all NIMH and the WTCCC-2 control samples) and Illumina HumanHap550v3 microarray chips. We used BEAGLE to flip orientation to the positive strand and to impute allelic dosages for autosomal SNPs in the cases and controls. Each data set was imputed separately using phased haplotype data from HapMap I & II release 24 (http://hapmap.ncbi.nlm.nih.gov/) as the reference panel. The WTCCC, NIMH, and German samples were imputed separately and subsequently combined. Quality control filters were applied to the individual studies and to the combined data set. SNPs were removed according to the following criteria: minor allele frequency<0.01, R2<0.3, Hardy-Weinberg equilibrium P<0.001 and a PLINK INFO score<0.8. The final imputed files contained 2 488 722 (NIMH), 2 404 996 (WTCCC) and 2 472 086 (German) markers.

Data analysis

Association analysis. We used the imputed allelic data from the NIMH, WTCCC and German data sets and performed test of association in logistic regression analysis for each dataset, comparing cases with MICP versus controls. Tests of association were performed in PLINK using allelic dosages in an additive logistic regression model including covariates for the top principal components. As a measure of potential residual population stratification, we used WGAviewer (Duke University, Durham, NC, USA) to calculate lambda (λ) as the ratio of the median observed versus expected χ2 statistic. To facilitate comparison with other samples, we also calculated a standardized lambda (λ1000) for a sample of 1000 cases and 1000 controls.

Case-only analysis. We performed a case-only analysis of our top findings. Principal components and association studies were performed in each individual sample using all cases with MICP versus all other cases without MICP. As several variables can lead to the designation of MICP, we included as negative only those subjects without any missing data for those variables.

Meta-analysis. Association analysis of each data set was performed with data set-specific principal components. Meta-analysis was subsequently performed in PLINK under a fixed effect model for both the case–control and case-only analysis.

Polygenic analysis. Using the PGC SZ results as a training set, we performed a polygenic analysis of each of the three samples in cases with and without MICP. The PGC SZ results were clumped into a data set of 85 284 markers in linkage equilibrium (r2<0.25) using the CEU HapMapII samples as a reference. We used PLINK to calculate polygenic scores using the SZ risk alleles weighted by their log(OR) as the initial training set sample. Polygenic scores were derived in each individual MICP samples using several P-value thresholds. A case-only logistic regression was subsequently performed for each data set using the polygenic scores as the dependent variable and each study’s principal components as covariates. Fixed effect meta-analysis of each association was subsequently performed in STATA (Stata Corp, College Station, TX, USA) using its ‘meta’ function. As a majority of the controls in the MICP analysis (5022 out of 8148) overlapped with the controls in the PGC schizophrenia (SZ) analysis, we restricted our MICP analysis to a case-only analysis.

Results

We first performed association analyses of each individual data set. The number of cases with mood-incongruent psychosis across the three data sets was 960 (43.8%) in the NIMH, 873 (46.7%) in the WTCCC and 363 (56.3%) in the German samples. Quantile-quantile (QQ) plots for each sample are shown in Supplementary Figure 1. The λ estimates were 1.01, 1.04 and 1.02 for the NIMH, WTCCC and German samples, respectively. There were no genome-wide significant findings within each sample, leading us to proceed to a fixed effect meta-analysis of the three data sets. The meta-analysis included a total of 2196 cases with BP with MICP and 8418 controls.

As shown in Figure 1 and Table 1, the most highly associated marker in the meta-analysis was rs1171113, with the C allele being overrepresented in cases (odds ratio (OR)=1.23, P=9.67 × 10–8). Figure 2a highlights this association in more detail, showing that rs1171113 tags an 150–200 kb region of chromosome 6q14.2 that encompasses the genes PRSS35 (protease, serine 35) and SNAP91 (synaptosomal-associated protein, 91 kDa homolog, also known as clathrin coat assembly protein AP180). The second strongest association was found on chromosome 3p22.2 with the rs9834970 marker (risk allele=T; OR=0.82; P=9.71 × 10–8). As shown in Figure 2b, this marker is 15 kb downstream of TRANK1 (tetratricopeptide repeat and ankyrin repeat containing 1 gene), also known as LBA1 (lupus brain antigen 1). Finally, Figure 2c shows the third strongest finding with a P<10−6, on chromosome 14q24.2 (rs2333194; P=7.0 × 10–7), which is in the third intron of NUMB. To determine whether these top findings were specifically associated with MICP, we performed a logistic regression analysis comparing BP cases with MICP versus all other BP cases. We found evidence of association of the risk alleles with MICP cases in two of our top findings: rs1171113 (OR=1.14, P=0.014) and rs2333194 (OR=1.12, P=0.02), but not in rs9834970 (OR=1.04, P=0.42).

Figure 1
figure 1

Manhattan plot of the meta-analysis comparing cases with bipolar disorder with mood-incongruent psychotic features to normal controls.

Table 1 Meta-analysis results for the comparison of bipolar disorder cases with mood-incongruent psychotic features to normal controls, showing markers associated with a P<1 × 10–5
Figure 2
figure 2

Regional and forest plots of the top three findings in the case–control meta-analysis. Results are shown for: (a) PRSS35/SNAP91, (b) TRANK1/LBA1 and (c) NUMB. Genome coordinates are based on NCBI 36/hg 18 build and recombination rates are derived from the HapMapII CEU sample.

To test the hypothesis that the MICP subset of BP may be associated with SZ, we performed a polygenic analysis of the MICP data set using a case-only design, as most of the controls in this study overlap with those of the PGC SZ study. First, we selected common markers between the PGC SZ and our MICP data sets and subsequently performed linkage disequilibrium-pruning to obtain markers in approximate linkage disequilibrium (r2<0.25). We derived a polygenic score from the PGC SZ analysis using several association P-value thresholds, ranging from 0.05–0.5, and performed a logistic regression analysis in each case-only sample, testing whether an individual’s polygenic score was associated with the presence or absence of MICP. Each sample was corrected for by its study-specific principal components. In a meta-analysis of the three samples, we found significant evidence for polygenic overrepresentation of SZ risk alleles in BP cases with MICP (Table 2), with the strongest evidence of association found at the P<0.2 threshold (Pmeta=0.003). Evidence of association was found across all P-value thresholds, suggesting that this association was unlikely to be spurious.

Table 2 Association of PGC schizophrenia polygenic scores with mood-incongruent psychosis in cases with bipolar disorder

Discussion

In this study we have combined three large BP GWAS samples to test the hypotheses that (1) a subphenotype of BP, marked by MICP, would show association in loci not found in analyses of the more broadly defined phenotype; and (2) that the MICP BP subphenotype would show evidence at a broader level for a relationship with SZ. Our meta-analysis of 2196 cases and 8418 controls found association of several markers with P<1 × 10–6, though none reached genome-wide significance. Prior family studies suggested that mood disorders with MICP might be etiologically related to SZ.14, 23 In support of this hypothesis, our polygenic analysis using risk alleles from the PGC SZ GWAS study found an association, en masse, between large numbers of SZ variants and the presence of MICP in BP cases, suggesting that genetic overlap across syndromes is potentially due to shared polygenic variation with individual effects that are likely too small to be detected in single marker GWAS analyses.

The marker with the strongest association (rs1171113), which was enriched in the case-only analysis, resides in a region of high linkage disequilibrium that includes the genes PRSS35 and SNAP91. The PRSS35 gene encodes an inactive serine protease homolog of no known function (reference uniprot database24). SNAP91 codes for a 91 kDa synaptosomal-associated protein, also known as clathrin assembly protein 180 (AP180), which is enriched in the presynaptic terminal and expressed only in neurons.25 SNAP91 is essential for the formation and function of clathrin coated vesicles, which are the major means of recycling vesicles at the presynaptic membrane.26 Interestingly, SNAP91 has also been proposed to have a role in calcium signaling and the Wnt pathway. Studies have found that the SNAP91 protein can inhibit two important intracellular signaling molecules, phospholipase C-γ1 and phospholipase D1.27, 28 Both molecules have been implicated in the Wnt pathway29 and phospholipase C-γ1 has a pivotal role in intracellular inositol signaling, which is hypothesized to have a role in lithium’s mechanism of action. As some studies have suggested that BP with mood-incongruent psychosis is less responsive to lithium treatment,12, 30 it is intriguing to speculate that SNAP91, via its effect on phospholipase C-γ1, may be involved in lithium response.

Our second strongest association signal (rs9834970) is ≈12 kb downstream of the LBA1 gene, which is also known as TRANK1 (tetratricopeptide repeat and ankyrin repeat containing 1), and was originally identified as a brain-specific antigen in a murine model of systemic lupus erythematosus, a human disorder with frequent neuropsychiatric symptoms.31 This marker was recently found to have genome-wide significant evidence for association with BP in a large meta-analysis of mixed ancestry samples.32 Notably, this marker also showed suggestive evidence for association with SZ in the PGC SZ meta-analysis (p=3.6 × 10–5).33 Our association with this marker was found with far smaller sample sizes; however, we did not find evidence that the rs9834970 risk allele was specifically associated with MICP in a case-only analysis, suggesting that this marker is likely to be representative of a risk allele for the broader diagnosis of BP. Our third strongest association (rs2333194) did show modest overrepresentation in MICP in our case-only analysis and was found in NUMB, an important modulator of the notch signaling pathway, which among other functions, has been shown to regulate neuronal differentiation in vitro.34

Psychiatric phenotypes are likely amalgams of multiple genetic, development and environmental influences. Although single gene findings can illuminate important pathophysiological mechanisms, phenotypic considerations are likely to be more correlated with markers of aggregate genetic effects, such as familiality or ‘global’ genetic analysis like polygenic risk profiling.35 In this study such a polygenic analysis has found evidence for association of the MICP BP phenotype with SZ, which is consistent with prior family studies, and provides further support for etiological ties between the major mood and psychotic disorders. A prior study found a similar association between SZ and a phenotype that is closely related to BP with MICP, that of schizoaffective disorder as diagnosed by the Research Diagnostic Criteria.36 Although cases with RDC schizoaffective disorder are included within our diagnosis of BP with MICP, we were not able to derive this RDC-based diagnosis from the DSM-based DIGS interview.

The results of this study should be interpreted in light of several limitations. Perhaps most important is the limited power to detect risk variants of small effect sizes, particularly as loss of statistical power is inherent in the subphenotype approach, which splits a broader case sample into smaller units. Even with the three large BP samples, our study (assuming a common and fully informative marker allele) only had sufficient power to detect ORs in the ≈1.3 range, which is slightly greater than that of our top finding (OR=1.23). Although larger samples will provide increased statistical power, they will also create the challenge that arises from the need to harmonize subphenotype definitions. As patients with psychosis may fail to recall specific psychotic symptoms,37 differential misclassification of mood-incongruence is an important concern, particularly in case-only analysis, which prompted us to focus our primary analysis on a case–control design.

In summary, we have performed a large GWAS of the MICP subphenotype and found no associations meeting genome-wide significance criteria, although several associations at the ‘suggestive’ level warrant further investigation. More globally, our polygenic analysis has found modest evidence for association of the MICP BP phenotype with SZ, which is consistent with prior family studies, and provides further support for etiological ties between the major mood and psychotic disorders.