Genome-wide analysis in UK Biobank identifies four loci associated with mood instability and genetic correlation with major depressive disorder, anxiety disorder and schizophrenia

Mood instability is a core clinical feature of affective and psychotic disorders. In keeping with the Research Domain Criteria approach, it may be a useful construct for identifying biology that cuts across psychiatric categories. We aimed to investigate the biological validity of a simple measure of mood instability and evaluate its genetic relationship with several psychiatric disorders, including major depressive disorder (MDD), bipolar disorder (BD), schizophrenia, attention deficit hyperactivity disorder (ADHD), anxiety disorder and post-traumatic stress disorder (PTSD). We conducted a genome-wide association study (GWAS) of mood instability in 53,525 cases and 60,443 controls from UK Biobank, identifying four independently associated loci (on chromosomes 8, 9, 14 and 18), and a common single-nucleotide polymorphism (SNP)-based heritability estimate of ~8%. We found a strong genetic correlation between mood instability and MDD (r g = 0.60, SE = 0.07, p = 8.95 × 10−17) and a small but significant genetic correlation with both schizophrenia (r g = 0.11, SE = 0.04, p = 0.01) and anxiety disorders (r g = 0.28, SE = 0.14, p = 0.04), although no genetic correlation with BD, ADHD or PTSD was observed. Several genes at the associated loci may have a role in mood instability, including the DCC netrin 1 receptor (DCC) gene, eukaryotic translation initiation factor 2B subunit beta (eIF2B2), placental growth factor (PGF) and protein tyrosine phosphatase, receptor type D (PTPRD). Strengths of this study include the very large sample size, but our measure of mood instability may be limited by the use of a single question. Overall, this work suggests a polygenic basis for mood instability. This simple measure can be obtained in very large samples; our findings suggest that doing so may offer the opportunity to illuminate the fundamental biology of mood regulation.


Introduction
Mood instability is a common clinical feature of affective and psychotic disorders, particularly major depressive disorder (MDD), bipolar disorder (BD) and schizophrenia 1 . It may also be relatively common in the general population, estimated to affect~13% of individuals 2 . As a dimensional psychopathological trait, it is potentially a useful construct in line with the Research Domain Criteria approach 3 . Mood instability may be of fundamental importance for understanding the pathophysiology of MDD and BD, as well as conditions such as borderline personality disorder, anxiety disorders, attention deficit hyperactivity disorder (ADHD) and psychosis 4 . This trait is reported by 40-60% of individuals with MDD 5 and is recognised as part of the prodromal stage of BD 6 . In established BD, it is a clinical feature that independently predicts poor functional outcome 7 . Furthermore, general population twin studies suggest that additive genetic effects account for 40% of the variance in measures of affect intensity and 25% of the variance in affective liability 8 .
Population-based studies such as the Adult Psychiatric Morbidity Survey (APMS) have defined mood instability based on responses to a single question, while clinical studies have made use of more detailed rating scales 4 . However, there is a lack of consensus about how best to measure and classify mood instability, and none of the currently available instruments adequately capture intensity, speed and frequency of affective change, or physiological and behavioural correlates. A recent systematic review proposed that mood instability be defined as 'rapid oscillations of intense affect, with a difficulty in regulating these oscillations or their behavioural consequences' 9 . Applying this definition will require the future development and validation of a multidimensional assessment of mood instability, which is currently not available.
Within the UK Biobank population cohort of over 0.5 million individuals 10 , the baseline assessment interview contained a question of relevance to mood instability, specifically: 'Does your mood often goes up and down?' This is similar to the question for mood instability used within the APMS ('Do you have a lot of sudden mood changes, suffered over the last several years'). Hypothesising that this simple question taps into pathological mood instability, we predicted that it would be more commonly endorsed by individuals within UK Biobank with MDD and BD, compared to individuals with no psychiatric disorder. Moreover, under the hypothesis that this trait has cross-disorder pathophysiological relevance, we predicted that a genome-wide association study (GWAS) might identify shared genetic liability to mood instability and risk for psychiatric disorders in which disordered mood is a feature, including MDD, BD, schizophrenia, ADHD, anxiety disorder and post-traumatic stress disorder (PTSD). Given the size of the sample, we also aimed to identify loci associated with this measure of mood instability.

Materials and methods
Sample UK Biobank is a large cohort of more than 502,000 United Kingdom residents, aged between 40 and 69 years 10 . The aim of UK Biobank is to study the genetic, environmental and lifestyle factors that cause or prevent disease in middle and older age. Baseline assessments occurred over a 4-year period, from 2006 to 2010, across 22 United Kingdom (UK) centres. These assessments were comprehensive and included social, cognitive, lifestyle and physical health measures. For the present study, we used the first genetic data release based on approximately one-third of UK Biobank participants. Aiming to maximise homogeneity, we restricted the sample to those who reported being of white UK ancestry (around 95% of the sample).
UK Biobank obtained informed consent from all participants, and this study was conducted under generic approval from the NHS National Research Ethics Service (approval letter dated 13 May 2016, Ref 16/NW/0274) and under UK Biobank approvals for application #6553 'Genome-wide association studies of mental health' (PI Daniel Smith).

Mood instability phenotype
As part of the baseline assessment, UK Biobank participants completed the 12 items of the neuroticism scale from the Eysenck Personality Questionnaire-Revised Short Form (EPQ-R-S) 11 . One of these items assesses mood instability, namely 'Does your mood often goes up and down?' Participants responding 'yes' to this question were considered to be cases of mood instability and those responding 'no' were considered controls. From the control sample, we excluded those who reported being on psychotropic medication, and those who reported a physician diagnosis of psychiatric disorder (including MDD, BD, anxiety/panic attacks, 'nervous breakdown', schizophrenia and deliberate self-harm/suicide attempt).
After quality-control steps (detailed below) and exclusions (3679 participants responded 'don't know' and 211 responded 'prefer not to say'), the final sample for genetic analysis comprised 53,525 cases of mood instability and 60,443 controls. Mood instability cases were younger than controls (mean age 55.8 years (SD = 8.05) vs. 57.7 years (SD = 7.74); p < 0.0001) and had a greater proportion of females (55.5% vs. 49.6%; p < 0.0001).

Genotyping and imputation
In June 2015, UK Biobank released the first set of genotypic data for 152,729 UK Biobank participants. Approximately 67% of this sample was genotyped using the Affymetrix UK Biobank Axiom array (Santa Clara, CA, USA) and the remaining 33% were genotyped using the Affymetrix UK BiLEVE Axiom array. These arrays have over 95% content in common. Only autosomal data were available under the data release. Data were preimputed by UK Biobank as fully described in the UK Biobank interim release documentation 12 . Briefly, after removing genotyped single-nucleotide polymorphisms (SNPs) that were outliers, or were multiallelic or of low frequency (minor allele frequency (MAF) < 1%), phasing was performed using a modified version of SHAPEIT2 and imputation was carried out using IMPUTE2 algorithms, as implemented in a C++ platform for computational efficiency 13,14 . Imputation was based upon a merged reference panel of 87,696,888 biallelic variants on 12,570 haplotypes constituted from the 1000 Genomes Phase 3 and UK10K haplotype panels 15 . Variants with MAF < 0.001% were excluded from the imputed marker set. Stringent quality control before release was applied by the Wellcome Trust Centre for Human Genetics, as described in UK Biobank documentation 16 .

Statistical analyses Quality control and association analyses
Before all analyses, further quality-control measures were applied. Individuals were removed based on UK Biobank genomic analysis exclusions (Biobank Data Dictionary item #22010), relatedness (#22012: genetic relatedness factor; a random member of each set of individuals with KING-estimated kinship coefficient >0.0442 was removed), gender mismatch (#22001: genetic sex), ancestry (#22006: ethnic grouping; principal component (PC) analysis identified probable Caucasians within those individuals who were self-identified as British and other individuals were removed from the analysis), and qualitycontrol failure in the UK BiLEVE study (#22050: UK BiLEVE Affymetrix quality control for samples and #22051: UK BiLEVE genotype quality control for samples). A sample of 113,968 individuals remained for further analyses. Of these, 53,525 were classed as cases and 60,443 were classified as controls. Genotype data were further filtered by removal of SNPs with Hardy-Weinberg equilibrium P < 10 −6 , with MAF < 0.01, with imputation quality score <0.4 and with data on <90% of the sample after excluding genotype calls made with <90% posterior probability, after which 8,797,848 variants were retained.
Association analysis was conducted in PLINK 17 using logistic regression under a model of additive allelic effects with sex, age, genotyping array and the first eight PCs (Biobank Data Dictionary items #22009.01 to #22009.08) as covariates. Sex and age were included as covariates because cases and controls differed significantly on these measures. Genetic PCs were included to control for hidden population structure within the sample, and the first eight PCs, out of 15 available in the Biobank, were selected after visual inspection of each pair of PCs, taking forward only those that resulted in multiple clusters of individuals after excluding individuals self-reporting as being of non-white British ancestry (Biobank Data Dictionary item #22006). Overall, population structure had little impact on mood instability status. The threshold for genome-wide significance was p < 5.0 × 10 −8 .

Heritability and genetic correlation between mood instability and psychiatric phenotypes
We applied Linkage Disequilibrium Score Regression (LDSR) 18 to the GWAS summary statistics to estimate SNP heritability (h 2 SNP ). Genetic correlations between mood instability and MDD, BD, schizophrenia, ADHD, anxiety disorder and PTSD were also evaluated using LDSR 19 (with unconstrained intercept), a process that corrects for potential sample overlap without relying on the availability of individual genotypes 18 . For the MDD, BD, schizophrenia, ADHD, anxiety disorder and PTSD phenotypes, we used GWAS summary statistics provided by the Psychiatric Genomics Consortium (http://www. med.unc.edu/pgc/) [20][21][22][23][24][25] . Note that for the purposes of these genetic correlation analyses we re-ran the GWAS of mood instability excluding from the cases those 9865 participants who reported being on psychotropic medication, or who self-reported psychiatric disorder (MDD, BD, anxiety/panic attacks, 'nervous breakdown', schizophrenia and deliberate self-harm/suicide attempt). This secondary GWAS output (rather than the primary GWAS reported below) was used for the genetic correlation calculations and for polygenic risk score (PRS) analyses, the rationale being that this was a more conservative approach that would avoid genetic correlations between mood instability and MDD/BD/schizophrenia/ADHD/ anxiety disorders/PTSD being driven by a subset of individuals with psychiatric disorder.

PRS analysis of MDD, BD and schizophrenia as predictors of mood instability
PRSs were created using the output of the PCG MDD 29 of 32 cohort GWAS (supplied by the MDD working group of the PGC, http://www.med.unc.edu/pgc/pgcworkgroups), BD GWAS 20 and schizophrenia GWAS 21 . Five PRS were created for each psychiatric phenotype using p value cutoffs of p < 5 × 10 −8 , p < 0.01, p < 0.05, p < 0.1 and p < 0.5, with the exception of MDD for which there were no genome-wide significant SNPs. Ambiguous SNPs, indels (insertion/deletion mutations) and SNPs with an imputation quality score of less than 0.8 were removed. LD clumping was performed via PLINK on a random sample of 10,000 individuals using an r 2 > 0.05 in a 250 kb window. SNPs were clumped into sets and filtered, selecting the SNP with the lowest p value from each set. In the event that two or more SNPs from a set had the same p value, the SNP with the largest beta coefficient was used. PLINK was also used to calculate the PRS to produce a per-allele weighted score with no mean imputation.

PRS modelling
Only those subjects who were used for the genetic correlation analyses were used in the PRS analyses (that is, PRS analyses also excluded from both case and control groups those individuals in UK Biobank with psychiatric disorder). Modelling was performed in R (version 3.1.2) using the glm function. Full sample and age-stratified analysis models were adjusted for age, sex, chip and PGCs 1-8, whereas sex-stratified analysis was not adjusted for sex. Scores were then split into deciles using the ntile function of the dplyr package. Model Nagelkerke r 2 was calculated using the fmsb package.

Mood instability in MDD and BD within UK Biobank
In previous work we have identified individuals within UK Biobank with a probable diagnosis of mood disorder, including cases of MDD (subdivided into single-episode MDD, recurrent moderate MDD and recurrent severe MDD) and BD, as well as non-mood disordered controls 26 . These classifications were independent of response to the mood instability question or other questions from the EPQ-R-S. For the group of participants who could be classified in this way, we assessed the proportion with mood instability within each mood disorder category. All mood disorder groups had a significantly greater proportion of individuals with mood instability compared with the control group (Table 1), in which the prevalence was 35.3%. This proportion was highest in the BD group (74.0%) followed by the three MDD groups (71.7% for recurrent severe MDD, 64.2% for recurrent moderate MDD and 43.7% for single-episode MDD). There were too few UK Biobank participants with a reliable classification of schizophrenia, ADHD, anxiety disorder or PTSD to allow for an assessment of the prevalence of mood instability in these groups.

GWAS of mood instability
The mood instability GWAS results are summarised in Fig. 1 (Manhattan plot), Fig. 2 (QQ plot) and Table 2 (genome-wide significant loci associated with mood instability). Regional plots are provided in Figs. 3a-d.
Overall, the GWAS data showed modest deviation in the test statistics compared with the null (λ GC = 1.13); this was negligible in the context of sample size (λ GC 1000 = 1.002). LDSR suggested that deviation from the null was due to a polygenic architecture in which h 2 SNP accounted for~8% of the population variance in mood instability (observed scale h 2 SNP = 0.077 (SE 0.007)), rather than inflation due to unconstrained population structure (LD regression intercept = 0.998 (SE 0.009)).
We observed four independent genomic loci exhibiting genome-wide significant associations with mood instability (Fig. 1, Table 2    SNPs across all loci. Given the functional alleles that drive association signals in GWAS may not affect the nearest gene, we use the above gene names to provide a guide to location rather than to imply that altered function or expression of those genes are the sources of the association signals. We also repeated this GWAS for males and females separately (Supplementary Figs. S1 and S2) and for the sample stratified according to median age (age 58 and below, and age 59 and above; Supplementary Figs. S3 and S4). No genome-wide significant loci were observed from these stratified analyses, possibly because of reduced power, apart from the retention of a single genome-wide significant finding at rs8084280 on chromosome 18 (the DCC gene) for males only (Supplementary Fig. S1). There was a high degree of genetic correlation between mood instability in males and females (r g = 1.02, SE = 0.09, p = 2.84 × 10 −30 ), and between mood instability in the younger and older subgroups (r g = 1.02, SE = 0.09, p = 2.67 × 10 −27 ).
Within supplementary materials, we also present the results of the secondary GWAS of mood instability that was excluded from the case group of 9865 participants with a psychiatric disorder (Supplementary Table S1). This GWAS was used to assess for genetic correlation between mood instability and MDD, BD, schizophrenia, ADHD, anxiety disorders and PTSD, and for the PRS analyses. Supplementary Table S1 shows that the risk allele frequencies (RAFs) of the index SNPs within the four genome-wide significant loci from the primary GWAS were very similar to the RAFs for these same SNPs within this secondary GWAS: for rs7829975 the RAF was 0.516 vs. 0.523; for rs10959826 it was 0.785 vs. 0.789; for rs397852991 it was 0.606 vs. 0.673; and for rs8084280 it was 0.508 vs. 0.514). However, it should be noted that, perhaps due to a loss of power from excluding 9865 individuals, only one of these four loci retained genome-wide significance (rs7829975 on chromosome 8).

PRS analysis of MDD, BD and schizophrenia as predictors of mood instability
Using the PRS approach, both MDD and schizophrenia had significant positive correlations with mood instability status (for MDD at p < 0.5 PRS threshold: OR = 1.029, 95% CI = 1.02-1.033, r 2 = 0.023, p = 1.00 × 10 −34 and for schizophrenia at p < 0.1 PRS threshold: OR = 1.009, 95% CI = 1.005-1.014, r 2 = 0.021, p = 6.71 × 10 −5 ; Supplementary Table S2). There was no evidence of an association between PRS for BD and mood instability. This finding of a positive correlation between PRSs for MDD and schizophrenia and mood instability status (and no such correlation for BD PRS) was consistent across additional analyses stratified for sex and age (Supplementary Tables S3-S6).

Discussion
We have identified four independent loci associated with mood instability within a large population cohort, in what is to date the only GWAS of this phenotype. We also identified a SNP-based heritability estimate for mood instability of~8%, and a strong genetic correlation between mood instability and MDD, suggesting substantial genetic overlap between mood instability and vulnerability to MDD. There was also a small but significant genetic correlation between mood instability and schizophrenia and between mood instability and anxiety disorders, but no significant genetic correlation with BD, ADHD or PTSD. PRS analyses found a positive correlation between genes for both MDD and schizophrenia and mood instability status, but this was not the case for BD.
The strong genetic correlation between mood instability and MDD is of interest because it is consistent with the hypothesis that at least part of the pathophysiology of MDD might include a reduced capacity to effectively regulate affective states. In support of this is evidence that individuals with MDD tend to have maladaptive responses to intense emotions, responding with worry, rumination and self-criticism, which can then exacerbate negative emotional states 27 . This maladaptive pattern of responses is also consistent with our finding of a small but significant genetic correlation between mood instability and both anxiety disorder and schizophrenia.
The lack of genetic correlation between mood instability and BD was unexpected, given that mood instability is considered a core deficit in BD 4 and was more common in our BD cases than MDD cases. Similarly, a genetic correlation between mood instability, ADHD and PTSD might have been anticipated. This lack of correlation between mood instability and BD/ADHD/PTSD is difficult to account for, but might be explained by the relatively underpowered nature of the BD, ADHD and PTSD GWAS analyses, compared to the analyses used for MDD and schizophrenia. It is worth noting that, although not significant, the magnitude of the genetic correlation between mood instability and ADHD was 0.14. Similarly, the genetic correlation between mood instability and PTSD was not significant but had a magnitude of 0.33.
It is well documented that MDD occurs more commonly in females than in males, and it is possible that mood instability may be of greater relevance as a crosscutting phenotype for women compared to men. We therefore carried out a GWAS of mood instability for males and females separately ( Supplementary Fig. S1 and Fig. S2). These stratified analyses found no genome-wide significant loci for females and only one genome-wide significant locus for males (the previously identified locus on chromosome 18). Furthermore, there was perfect genetic correlation between mood instability in males and females. Although these analyses had reduced power, they suggest that there was no evidence for a large number of sex-specific loci for mood instability. Similarly, we carried out GWAS stratified by age, for those in the sample at or below the median age of 58 and for those above age 58 (Supplementary Figs. S3 and S4). As with stratification by sex, these age-stratified analyses did not identify any genome-wide significant loci, and there was perfect correlation between mood instability in the younger and older subgroups.
It is not possible to be certain which of the genes within associated loci are likely to be most relevant to the pathophysiology of mood instability but several genes of interest were identified. For example, the lead SNP within the associated region on chromosome 18 lies in intron 9 of the DCC netrin 1 receptor (originally named deleted in colorectal cancer; DCC) gene, with no other proteincoding genes for >500 kb on either side (Fig. 3d). DCC is the receptor for the guidance cue netrin 1, which has a central role in the development of the nervous system, including (but not limited to) the organisation and function of mesocorticolimbic dopamine systems 28 . Recent studies have shown a range of human phenotypes associated with loss-of-function mutations in DCC, including agenesis of the corpus callosum, learning disabilities and mirror movements, all associated with a large-scale disruption of the development of commissural connectivity and lateralisation 29,30 . Manitt et al. have identified that DCC has a role in regulating the connectivity of the medial prefrontal cortex during adolescence and found that DCC expression was elevated in the brain tissue of antidepressant-free subjects who committed suicide 31 . This suggests a possible role for DCC variants in increasing predisposition to mood instability and mood disorders, as well as related psychopathological phenotypes.
The associated region on chromosome 14 contains at least 10 candidate genes ( Table 2 and Fig. 3c). One of these is translation initiation factor 2B subunit beta (EIF2B2), mutations in which are known to cause a range of clinically heterogeneous leukodystrophies 32 . Reduced white matter integrity has been consistently associated with negative emotionality traits (such as harm avoidance, neuroticism and trait anxiety) 33 , as well as with MDD and BD 34 . It is therefore possible that variation in EIF2B2 may have a role in mood instability.
Another gene within the associated region on chromosome 14 is placental growth factor (PGF), a member of the angiogenic vascular endothelial growth factor (VEGF) family 35,36 , which is expressed at high levels in the placenta and thyroid 37 . PGF has a wide range of functions, including embryonic thyroid development 38 and immune system function 39,40 , as well as a role in atherosclerosis, angiogenesis in cancer, cutaneous delayed-type hypersensitivity, obesity, rheumatoid arthritis and preeclampsia 39,[41][42][43][44] . PGF may be of interest because of the long-established association between thyroid dysfunction and both MDD and BD 45 , along with the recent observation that pre-eclampsia may be a marker for the subsequent development of mood disorders 46 .
Also of interest is the finding that the gene for protein tyrosine phosphatase, receptor type D (PTPRD) lies within 1 Mb of the associated region on chromosome 9 (Fig. 3b). PTPRD encodes a receptor type protein tyrosine phosphatase known to be expressed in the brain and with an organising role at a variety of synapses, including those that play a role in synaptic plasticity 47 . As such, it may have a role in a broad range of psychopathology.
Two of the genomic loci associated with mood instability (on chromosomes eight and nine) overlap with loci found to be associated with neuroticism in a recent GWAS and meta-analysis, which combined data from the UK Biobank cohort, the Generation Scotland cohort and a cohort from the Queensland Institute of Medical Research 48 . The neuroticism study made use of scores on the 12-item EPQ-R-S questionnaire, of which one of the questions was the mood instability question used in the present study. This overlap in findings suggests that mood instability is a key component of neuroticism as defined by the EPQ-R-S and that at least some of the gene variants implicated in mood instability are likely to contribute to the broader phenotype of neuroticism. We did not assess for genetic correlation between mood instability and neuroticism using LDSR because both GWAS outputs were predominantly from the same UK Biobank sample.

Strengths and limitations
To the best of our knowledge, this is the first reported GWAS of mood instability. It has enabled objective estimates of heritability and genetic correlation with important psychiatric disorders to be made for the first time. In the future, genotyping data for the full UK Biobank sample (502,000 participants) will be available. This increased sample size may identify larger estimates of shared variance between mood instability and psychiatric disorders.
Some important limitations of this work are acknowledged. The mood instability phenotype used was based on response to a single-item question ('Does your mood often goes up and down?'), which may be an imperfect measure of mood instability. Approximately 44% of the whole UK Biobank cohort answered 'yes' to this question, a much larger proportion than the 13% of participants classified as having mood instability within the UK APMS 2 . This may be because the assessment of mood instability in the APMS was based on a slightly different question ('Do you have a lot of sudden mood changes') and because respondents had to additionally report that they 'suffered this symptom over the last several years'. Clearly, a potential limitation of self-report is the possibility of responder bias and, further, a more complete and objectively assessed measure of mood instability would have been preferable. However, this was not available to us in the UK Biobank phenotype data set and is unlikely to be feasible to collect within a population cohort of this size.

Conclusions
Despite a recognition that mood instability is likely to be an important phenotype underpinning a range of psychiatric disorders-particularly mood disorders 4 -there has to date been very little work on its neural correlates. Early investigations tentatively suggest a role for altered function and/or connectivity of the amygdala 49 , but this is an area that is currently underdeveloped. It is hoped that our findings will stimulate new research on mood instability, which may be a clinically useful and biologically valid trait that cuts across traditional diagnostic categories 50 .