Precision medicine for mood disorders: objective assessment, risk prediction, pharmacogenomics, and repurposed drugs

Le-Niculescu, H.; Roseberry, K.; Gill, S. S.; Levey, D. F.; Phalen, P. L.; Mullen, J.; Williams, A.; Bhairo, S.; Voegtline, T.; Davis, H.; Shekhar, A.; Kurian, S. M.; Niculescu, A. B.

doi:10.1038/s41380-021-01061-w

Download PDF

Immediate Communication
Open access
Published: 08 April 2021

Precision medicine for mood disorders: objective assessment, risk prediction, pharmacogenomics, and repurposed drugs

Molecular Psychiatry volume 26, pages 2776–2804 (2021)Cite this article

60k Accesses
44 Citations
713 Altmetric
Metrics details

Subjects

Abstract

Mood disorders (depression, bipolar disorders) are prevalent and disabling. They are also highly co-morbid with other psychiatric disorders. Currently there are no objective measures, such as blood tests, used in clinical practice, and available treatments do not work in everybody. The development of blood tests, as well as matching of patients with existing and new treatments, in a precise, personalized and preventive fashion, would make a significant difference at an individual and societal level. Early pilot studies by us to discover blood biomarkers for mood state were promising [1], and validated by others [2]. Recent work by us has identified blood gene expression biomarkers that track suicidality, a tragic behavioral outcome of mood disorders, using powerful longitudinal within-subject designs, validated them in suicide completers, and tested them in independent cohorts for ability to assess state (suicidal ideation), and ability to predict trait (future hospitalizations for suicidality) [3,4,5,6]. These studies showed good reproducibility with subsequent independent genetic studies [7]. More recently, we have conducted such studies also for pain [8], for stress disorders [9], and for memory/Alzheimer’s Disease [10]. We endeavored to use a similar comprehensive approach to identify more definitive biomarkers for mood disorders, that are transdiagnostic, by studying mood in psychiatric disorders patients. First, we used a longitudinal within-subject design and whole-genome gene expression approach to discover biomarkers which track mood state in subjects who had diametric changes in mood state from low to high, from visit to visit, as measured by a simple visual analog scale that we had previously developed (SMS-7). Second, we prioritized these biomarkers using a convergent functional genomics (CFG) approach encompassing in a comprehensive fashion prior published evidence in the field. Third, we validated the biomarkers in an independent cohort of subjects with clinically severe depression (as measured by Hamilton Depression Scale, (HAMD)) and with clinically severe mania (as measured by the Young Mania Rating Scale (YMRS)). Adding the scores from the first three steps into an overall convergent functional evidence (CFE) score, we ended up with 26 top candidate blood gene expression biomarkers that had a CFE score as good as or better than SLC6A4, an empirical finding which we used as a de facto positive control and cutoff. Notably, there was among them an enrichment in genes involved in circadian mechanisms. We further analyzed the biological pathways and networks for the top candidate biomarkers, showing that circadian, neurotrophic, and cell differentiation functions are involved, along with serotonergic and glutamatergic signaling, supporting a view of mood as reflecting energy, activity and growth. Fourth, we tested in independent cohorts of psychiatric patients the ability of each of these 26 top candidate biomarkers to assess state (mood (SMS-7), depression (HAMD), mania (YMRS)), and to predict clinical course (future hospitalizations for depression, future hospitalizations for mania). We conducted our analyses across all patients, as well as personalized by gender and diagnosis, showing increased accuracy with the personalized approach, particularly in women. Again, using SLC6A4 as the cutoff, twelve top biomarkers had the strongest overall evidence for tracking and predicting depression after all four steps: NRG1, DOCK10, GLS, PRPS1, TMEM161B, GLO1, FANCF, HNRNPDL, CD47, OLFM1, SMAD7, and SLC6A4. Of them, six had the strongest overall evidence for tracking and predicting both depression and mania, hence bipolar mood disorders. There were also two biomarkers (RLP3 and SLC6A4) with the strongest overall evidence for mania. These panels of biomarkers have practical implications for distinguishing between depression and bipolar disorder. Next, we evaluated the evidence for our top biomarkers being targets of existing psychiatric drugs, which permits matching patients to medications in a targeted fashion, and the measuring of response to treatment. We also used the biomarker signatures to bioinformatically identify new/repurposed candidate drugs. Top drugs of interest as potential new antidepressants were pindolol, ciprofibrate, pioglitazone and adiphenine, as well as the natural compounds asiaticoside and chlorogenic acid. The last 3 had also been identified by our previous suicidality studies. Finally, we provide an example of how a report to doctors would look for a patient with depression, based on the panel of top biomarkers (12 for depression and bipolar, one for mania), with an objective depression score, risk for future depression, and risk for bipolar switching, as well as personalized lists of targeted prioritized existing psychiatric medications and new potential medications. Overall, our studies provide objective assessments, targeted therapeutics, and monitoring of response to treatment, that enable precision medicine for mood disorders.

Metabolic features of treatment-refractory major depressive disorder with suicidal ideation

Article Open access 15 December 2023

A methylation study of long-term depression risk

Article 09 September 2019

A machine learning algorithm to differentiate bipolar disorder from major depressive disorder using an online mental health questionnaire and blood biomarker data

Article Open access 12 January 2021

Introduction

“How weary, stale, flat, and unprofitable

Seem to me all the uses of this world!”

– W. Shakeaspeare, Hamlet

“There are good and bad times, but our mood changes more often than our fortune.”

– Thomas Carlyle

Mood disorders affect up to 1 in 4 individuals in their lifetime. Depression in particular is the leading cause of disability for ages 15–44, a prime productive and reproductive age. Due to the lack of objective tests and the perceived presence of stigma, mood disorders are often underdiagnosed or misdiagnosed (depression instead of bipolar disorder). They are also sub-optimally treated, can lead to self-medication with alcohol and drugs, and may culminate in some cases with suicide.

Blood biomarkers are emerging as important tools in disorders where subjective self-report of an individual or clinical impression of a healthcare professional are not always reliable, and for predicting future risk before the disorder (re-)occurs. They also open the door to precise, personalized matching with medications, and objective monitoring of response to treatment. Pioneering early work by our group has identified candidate blood gene expression biomarkers for mood state using a case–case design and a visual analog scale (VAS) (Le-Niculescu et al.) [1]. Those biomarkers were also validated independently as tracking response to cognitive-behavioral therapy by another group [2]. Recent work by our group has identified blood gene expression biomarkers that track suicidality, a tragic outcome of mood disorders, using a new powerful within- subject longitudinal stepwise approach [4, 5, 11]. These studies show good reproducibility and provide a Rosetta Stone for recent multiple genetic studies of suicide (GWAS, family based) [7]. More recently, we have conducted such studies for pain [8], for stress disorders [9], and for memory/Alzheimer’s Disease [10].

We endeavored to use a similar comprehensive approach to identify more definitive biomarkers for mood disorders in general, and depression in particular. Psychiatric patients may have an increased vulnerability to mood disorders, regardless of their primary diagnosis, as well as increased reasons for mood disorders, due to their often-adverse life trajectory. As such, they may be a particularly suitable population in which to try to identify blood biomarkers for mood disorders, that are generalizable and transdiagnostic. First, we used a powerful longitudinal within-subject design (Fig. 1 and Table 1) in individuals with psychiatric disorders to discover blood gene expression changes between self-reported low mood and high mood states, measured by a VAS, called the Simplified Affective State Scale (SASS), previously described by us [4, 5, 11, 12], which has a subscale of seven items related to mood (SMS-7) (Fig. S1). Second, we prioritized this list of candidate biomarkers with a Bayesian-like CFG approach, comprehensively integrating previous human and animal model evidence in the field. Third, we validated our top candidate biomarkers for mood from discovery and prioritization in an independent cohort of psychiatric subjects with clinically severe depression (as measured by HAMD) or with clinically severe mania (as measured by YMRS). We also analyzed the biological pathways and networks they are involved in (Table 2). Fourth, we tested if the top candidate biomarkers from the first three steps are able to predict low mood state, clinical depression state, and future hospitalizations with depression, in another independent cohort of psychiatric subjects. We tested the biomarkers in all subjects in the test cohort, as well as in a more personalized fashion by gender and psychiatric diagnosis (Fig. 2A–D). We also conducted similar analyses for predictions of high mood, clinical mania state, and future hospitalizations with mania (Table 3B, C, and Supplementary Information—Pathways, Predictions and Reproducibility). Next, we identified which of our biomarkers are targets of existing drugs and thus can be used for pharmacogenomic population stratification and measuring of response to treatment for depression. We also used the biomarker gene expression signatures to interrogate the Connectivity Map database from Broad/MIT, and the NIH LINCS database, in order to identify drugs and natural compounds that can be repurposed for treating and preventing depression, including bipolar depression. Finally, we provide an example of how a personalized patient report can be generated for clinicians to use, reflecting the objective assessment of depression state, future risk of severe depression, risk of bipolarity, matching with existing psychiatric medications, matching with non-psychiatric/repurposed medications, and monitoring response to treatment.

**Fig. 1: Steps 1–3: Discovery, Prioritization and Validation of Biomarkers for Mood.**

**Fig. 2: Best single biomarkers predictors for depression, state and trait.**

Table 1 Demographics of cohorts used.

Full size table

Table 2 Biology of mood biomarkers. A Pathway analyses. B Diseases.

Full size table

Table 3 Convergent functional evidence (CFE): A Top biomarkers for low mood/ depression. n = 12 genes, 13 probesets, using as a cutoff the score for SLC6A4; B Top biomarkers for bipolar mood disorders. n = 6 genes, using as a cutoff the score for SLC6A4. These genes are contained in the list of top biomarkers for depression in A. C Top Biomarkers for High Mood/ Mania. n = 2 genes, using as a cutoff the score for SLC6A4. RPL3 is not overlapping with the list of top biomarkers for depression in A.

Full size table

Materials and methods

Cohorts

We used three independent cohorts: (1) discovery (a longitudinal psychiatric subjects cohort with diametric changes in mood state from at least two consecutive testing visits); (2) validation (an independent psychiatric subjects cohort with clinically severe depression or mania); and (3) testing (an independent psychiatric subjects test cohort for predicting mood state, clinical depression or mania, and for predicting future hospitalizations for depression or mania) (Fig. 1A and Table 1).

Similar to our previous studies in suicide [3,4,5], the live psychiatric subjects are part of a larger longitudinal cohort of adults that we are continuously collecting. Subjects are recruited primarily from the patient population at the Indianapolis VA Medical Center. All subjects understood and signed informed consent forms detailing the research goals, procedure, caveats and safeguards, per IRB approved protocol. Subjects completed diagnostic assessments by structured clinical interviews. They had an initial testing visit in the lab or on the inpatient psychiatric unit, followed by up to six testing visits, 3–6 months apart or whenever a new psychiatric hospitalization occurred. At each testing visit, they received a series of psychiatric rating scales, and their blood was drawn. The rating scales included the Hamilton Rating Scale for Depression-17 (HAMD), the Young Mania Rating Scale (YMRS), and a visual analog scale for assessing mood state (SMS-7), which provides a score that is the average of seven items (Fig. S1A), and is part of the SASS (Niculescu et al. [12], Niculescu et al. [4], Levey et al. [5], Niculescu et al. [6]). SMS-7 integrates on a continuum in a quantitative fashion clinical symptoms for depression and mania, and provides a score for mood state at a particular moment in time. This is a state measure, related to how people feel in the present. It has good face validity based on DSM criteria, and correlates inversely with HAMD [12] (Fig. S1B). SASS, in addition to seven items measuring mood (SMS-7), also has four items measuring anxiety (SAS-4). We also used the PANSS Positive scale, that measures positive psychotic symptoms. These last two measures (SAS-4 and PANSS Positive) may define subtypes of low mood, as shown in the Discovery cohort (Fig. S1E).

We also created and used a checklist/measure of clinical severity of bipolar disorder, based on past history, called Convergent Functional Information for Bipolar Disorder Severity (CFI-BP) scale, ranking patients with mood disorders on a scale of 1–10. This is a trait measure, related to how people behaved in their past (Fig. S2).

At each visit, we collected whole blood (5 ml) in two RNA-stabilizing PAXgene tubes, labeled with an anonymized study ID number, and stored at −80 °C in a locked freezer until the time of future processing. Whole-blood RNA was extracted for microarray gene expression studies from the PAXgene tubes, as detailed below.

For this study, our within-subject discovery cohort, from which the biomarker data were derived, consisted of 44 subjects (30 males, 14 females) with psychiatric disorders and multiple testing visits, who each had at least one diametric change in SMS-7 mood scores from low mood (SMS-7 ≤ 40) to high mood (SMS-7 ≥ 60), or vice versa, from one testing visit to another. There were 4 subjects with 6 visits each, 6 subjects with 4 visits each, 18 subjects with 3 visits each, and 16 subjects with 2 visits each resulting in a total of 134 blood samples for subsequent gene expression microarray studies (Fig. 1, Tables 1 and S1).

Our independent validation cohort, in which the top biomarker findings were validated for being even more changed in expression, consisted of 39 male and 8 female subjects with a clinically severe mood disorder (n = 30 depression as measured by HAMD scores ≥22, and n = 17 mania as measured by YMRS scores ≥20), and concordant low mood, respectively high mood, SMS-7 scores (Tables 1 and S1).

Our independent test cohort for predicting low-mood state (SMS-7 ≤ 40) and high-mood state (SMS-7 ≥ 60) consisted of 153 male and 37 female subjects with psychiatric disorders, demographically matched with the discovery cohort, with one or multiple testing visits in our study, with either low mood, intermediate mood, or high mood states (Fig. 1 and Table 1).

Our independent test cohort for predicting clinical depression state (HAMD ≥ 22) consisted of 181 male and 45 female subjects with psychiatric disorders, demographically matched for age, with one or multiple testing visits in our study, with either low, intermediate, or high HAMD scores. Our independent test cohort for predicting clinical mania state (YMRS ≥ 20) consisted of 73 males and 24 female subjects with psychiatric disorders, demographically matched for age, with one or multiple testing visits in our study, with either low, intermediate, or high YMRS scores (Fig. 1 and Table 1).

Our test cohorts for predicting future hospitalizations with depression, and future hospitalizations with mania (Fig. 1 and Table 1), are a subset of the independent test cohort for which we had longitudinal follow-up with electronic medical records. The subjects’ subsequent number of hospitalizations with depression, and with mania, was tabulated from electronic medical records.

Medications

The subjects in the discovery cohort were all diagnosed with various psychiatric disorders (Table 1), and had various medical co-morbidities. Their medications were listed in their electronic medical records, and documented by us at the time of each testing visit. Medications can have a strong influence on gene expression. However, there was no consistent pattern of any particular type of medication, as our subjects were on a wide variety of different medications, psychiatric and non-psychiatric. Furthermore, the independent validation and testing cohort’s gene expression data were Z-scored by gender and by diagnosis before being combined, to normalize for any such effects. Some subjects may be non-compliant with their treatment and may thus have changes in medications or drug of abuse not reflected in their medical records. That being said, our goal is to find biomarkers that track mood, regardless if the reason for it is endogenous biology or it is driven by medications or drugs. In fact, one would expect some of these biomarkers to be targets of medications, as we show in this paper. Moreover, the prioritization step that occurs after discovery is based on a field-wide convergence with literature that includes genetic data and animal model data, that are unrelated to medication effects. Overall, the discovery, validation, and replication by testing in independent cohorts of the biomarkers, with our design, occurs despite the subjects having different genders, diagnoses, being on various different medications, and other lifestyle variables.

Blood gene expression experiments

RNA extraction

Whole blood (2.5 ml) was collected into each PaxGene tube by routine venipuncture. PaxGene tubes contain proprietary reagents for the stabilization of RNA. RNA was extracted and processed as previously described [3,4,5].

Microarrays

Microarray work was carried out using previously described methodology [3,4,5,6].

Of note, all genomic data were normalized (RMA for technical variability, then z-scoring for biological variability), by gender and psychiatric diagnosis, before being combined and analyzed.

Biomarkers

Step 1: Discovery

We have used the subject’s score from a visual-analog scale (SMS-7), assessed at the time of blood collection (Fig. 1). We analyzed gene expression differences between visits with low mood (defined as a score of 0–40) and visits with high mood (defined as a score of 60–100), using a powerful within-subject design, then an across-subjects summation (Fig. 1).

We analyzed the data in two ways: an absent–present (AP) approach, and a differential expression (DE) approach, as in previous work by us on suicide biomarkers [3,4,5]. The AP approach may capture turning on and off of genes, and the DE approach may capture gradual changes in expression. Analyses were performed as previously described [4,5,6]. In brief, we imported all Affymetrix microarray data as CEL. files into Partek Genomic Suites 6.6 software package (Partek Incorporated, St Louis, MI, USA). Using only the perfect match values, we ran a robust multi-array analysis (RMA) by gender and diagnosis, background corrected with quantile normalization and a median polish probeset summarization of all chips, to obtain the normalized expression levels of all probesets for each chip. Then, to establish a list of differentially expressed probesets we conducted a within-subject analysis, using a fold change in expression of at least 1.2 between consecutive high- and low-mood visits within each subject. Probesets that have a 1.2-fold change are then assigned either a 1 (increased in high mood) or a −1 (decreased in high mood) in each comparison. Fold changes between 1.1 and 1.2 are given 0.5, and fold changes less than 1.1 are given 0. These values were then summed for each probeset across all the comparisons and subjects, yielding a range of raw scores. The probesets above the 33.3% of raw scores were carried forward in analyses (Fig. 1), and received an internal score of 2 points; those above 50% 4 points, and those above 80% 6 points [4,5,6]. We have developed in our labs R scripts to automate and conduct all these large dataset analyses in bulk, checked against human manual scoring [6].

Gene Symbol for the probesets were identified using NetAffyx (Affymetrix) for Affymetrix HG-U133 Plus 2.0 GeneChips, followed by GeneCards to confirm the primary gene symbol. In addition, for those probesets that were not assigned a gene symbol by NetAffyx, we used GeneAnnot (https://genecards.weizmann.ac.il/geneannot/index.shtml), or if need be UCSC (https://genome.ucsc.edu), to obtain gene symbol for these uncharacterized probesets, followed by GeneCard. Genes were then scored using our manually curated convergent functional genomics (CFG) databases as described below (Fig. 1D).

Step 2: Prioritization using CFG

Databases

We have established in our laboratory (Laboratory of Neurophenomics, www.neurophenomics.info) manually curated databases of the human gene expression/protein expression studies (postmortem brain, peripheral tissue/fluids: CSF, blood and cell cultures), human genetic studies (association, copy number variations and linkage), and animal model gene expression and genetic studies, published to date on psychiatric disorders. Only findings deemed significant in the primary publication, by the study authors, using their particular experimental design and thresholds, are included in our databases. Our databases include only primary literature data and do not include review papers or other secondary data integration analyses to avoid redundancy and circularity. We also favored unbiased discovery studies over candidate genes hypothesis-driven studies. These large and constantly updated databases have been used in our CFG cross validation and prioritization platform (Fig. 1D). For this study, data from 1600 papers on mood disorders were present in the databases at the time of the CFG for mood disorders analyses (June 2018) (human genetic studies-759, human brain studies-246, human peripheral tissue/fluids- 359, non-human genetic studies-47, non-human brain/studies-167, non-human peripheral tissue/fluids- 22). We have developed in our lab a computerized CFG Wizard to automate and score in bulk large lists of genes by integrating evidence from these large databases, checked against manual scoring [6]. Analyses were performed as previously described [4, 5].

Step 3: Validation analyses

We examined which of the top candidate genes (score of 6 or above after the first two steps) were stepwise changed in expression from the clinically depressed validation group to the low-mood discovery group to the high-mood discovery group to the clinically manic validation group. A total score of 6 or above after the first two steps permits the inclusion of potentially novel genes with maximal internal score of 6 from discovery but no external evidence CFG score from prioritization. Subjects with low mood as well as subjects with high mood from the discovery cohort who did not have clinically severe depression or mania were used, along with the independent validation cohort (n = 47).

The AP-derived and DE-derived lists of genes were combined, and the gene expression data corresponding to them was used for the validation analysis. The four groups (clinical depression, low mood, high mood, clinical mania) were assembled out of Affymetrix.cel data that were RMA normalized by gender and diagnosis. We transferred the log transformed expression data to an Excel sheet, and non-log transformed the data by taking 2 to the power of the transformed expression value. We then Z-scored the values by gender and diagnosis. We then imported the excel sheets with the Z-scored by gender and diagnosis expression data into Partek, and statistical analyses were performed using a one-way ANOVA for the stepwise changed probesets, and also did a stringent Bonferroni correction for all the probesets tested in ANOVA (Fig. 1E).

Top candidate biomarkers (after the first 3 steps)

Adding the scores from the first three steps into an overall convergent functional evidence (CFE) score (Fig. 1), we ended up with a list of 26 top candidate biomarkers (26 probesets in n = 23 genes), that had evidence, i.e., a CFE score, as good as or better than SLC6A4 (the serotonin transporter) (see also Supplementary Information- Pathways, Predictions and Reproducibility). SLC6A4 is arguably the most well studied molecular underpinning of mood disorders in biological psychiatry, and the target of the majority of antidepressant medications. We discovered it empirically as a blood biomarker as part of our work, and used it as a de facto positive control and cutoff. These 26 top candidate biomarkers were carried forward into additional analyses for biological understanding and for clinical utility.

Biological understanding

Clock gene database

We compiled a database of genes associated with circadian function, by using a combination of review papers [13, 14] and searches of existing databases CircaDB (http://circadb.hogeneschlab.org), GeneCards (http://www.genecards.org), and GenAtlas (http://genatlas.medecine.univ-paris5.fr). Using the data we compiled from these sources we identified a total of 1468 genes that show circadian functioning. We further classified genes into “core” clock genes, i.e., those genes that are the main engine driving circadian function (n = 18), “immediate” clock genes, i.e., the genes that directly input or output to the core clock (n = 331), and “distant” clock genes, i.e., genes that directly input or output to the immediate clock genes (n = 1119).

Pathway analyses

IPA (Ingenuity Pathway Analysis, version 24390178, Qiagen), David Functional Annotation Bioinformatics Microarray Analysis (National Institute of Allergy and Infectious Diseases) version 6.7 (August 2016), and Kyoto Encyclopedia of Genes and Genomes (KEGG) (through DAVID) were used to analyze the biological roles, including top canonical pathways and diseases (Table 2). We performed the pathway analyses for the 26 biomarkers (23 unique genes) that were the top candidate biomarkers after the discovery, prioritization, and validation.

Networks

For network analyses we performed STRING Interaction network (https://string-db.org) by inputting the genes into the search window, and performed Multiple Proteins Homo sapiens analysis (Fig. S3).

CFG beyond mood: evidence for involvement in other psychiatric and related disorders

We also used a CFG approach to examine evidence from other psychiatric and related disorders, as exemplified for the list of top biomarkers after Step 4 testing (Table S3). This was not used to prioritize genes, but rather to understand the molecular basis of clinical co-morbidities.

Testing for clinical utility in independent cohorts

We tested in independent cohorts of psychiatric patients the ability of each of the top candidate biomarkers (n = 26) to assess state severity (mood (measured by SMS-7), depression (measured by HAMD), mania (measured by YMRS)), and predict trait risk (future hospitalizations with depression, future hospitalizations with mania). We conducted our analyses across all patients, as well as personalized by gender and diagnosis. We then predict with the biomarkers from the list in independent cohorts state (low-mood SMS-7 ≤ 40, depression HAMD ≥ 22), and trait (Future Hospitalizations with Depression) in the first year of follow-up, and in all future years of follow-up. We also conducted similar analyses for predicting high mood, mania, and future hospitalizations for mania.

The test cohort for predicting low mood/depression(state), and the test cohort for predicting future Hospitalizations with Depression (trait), was assembled out of data that were RMA normalized by gender and diagnosis. The cohort was completely independent from the discovery and validation cohorts, there was no subject overlap with them. Individual markers used for predictions were Z-scored by gender and diagnosis, to be able to combine different biomarkers into panels and to avoid potential artefacts due to different ranges of expression in different gender and diagnoses. For panels, biomarkers were combined by simple summation of the increased risk biomarkers minus the decreased risk biomarkers. Predictions were performed using R-studio. For cross-sectional analyses, we used biomarker expression levels, z-scored by gender and diagnosis. For longitudinal analyses, we combined four measures: biomarker expression levels, slope (defined as ratio of levels at current testing visit vs. previous visit, divided by time between visits), maximum levels (at any of the current or past visits), and maximum slope (between any adjacent current or past visits). For decreased biomarkers, we used the minimum rather than the maximum for level calculations. All four measures were Z-scored, then combined in an additive fashion into a single measure. The longitudinal analysis was carried out in a sub-cohort of the testing cohort consisting of subjects that had at least two visits (timepoints).

Predicting state- low mood, clinical depression

Receiver-operating characteristic (ROC) analyses between marker levels and mood state were performed by assigning subjects visits with a mood SMS-7 score of ≤40 into the low mood category, and subjects with HAMD scores ≥22 in the clinically depressed category. We used the pROC package of R (Xavier Robin et al. BMC Bioinformatics 2011). (Table 3 and Fig. 2). In addition, a one-tailed t-test was performed between low mood group vs. the rest, and Pearson R (one-tail) was calculated between mood scores and biomarker levels.

Similar analyses were conducted for high mood state (SMS-7 score of ≥60) and clinical mania state (YMRS ≥ 20).

Predicting trait- future psychiatric hospitalization with depression as a symptom/reason for admission

We conducted analyses for predicting future psychiatric hospitalizations with depression as a symptom/reason for admission in the first year following each testing visit, in subjects that had at least 1 year of follow-up in the VA system, in which we have access to complete electronic medical records. ROC analyses between biomarkers measures (cross-sectional, longitudinal) at a specific testing visit and future hospitalizations were performed as described above, based on assigning if subjects had been admitted to the hospital with depression or not. In addition, a one tailed t-test with unequal variance was performed between groups of subject visits with and without future hospitalization with depression. Pearson R (one-tail) correlation was performed between hospitalization frequency (number of hospitalizations with depression divided by duration of follow-up) and marker levels. A Cox regression was performed using the time in days from the testing visit date to first hospitalization date in the case of patients who had been hospitalized, or 365 days for those who did not. The odds ratio (OR) was calculated such that a value greater than 1 always indicates increased risk for hospitalization, regardless if the biomarker is increased or decreased in expression.

We also conducted Cox regression and Pearson R analyses for all future hospitalizations with depression, including those occurring beyond 1 year of follow-up, in the years following testing (on average 5.12 years per subject, range 0.07–11.27 years), as these calculations, unlike the ROC and t-test, account for the actual length of follow-up, which varied from subject to subject. The ROC and t-test might in fact, if used, under-represent the power of the markers to predict, as the more severe psychiatric patients are more likely to move geographically and/or be lost to follow-up. The Cox regression was performed using the time in days from visit date to first hospitalization date in the case of patients who had hospitalizations with depression, or from visit date to last note date in the electronic medical records for those who did not.

Similar analyses were conducted for future hospitalizations with mania as a symptom/reason for hospitalization.

Therapeutics

Pharmacogenomics

We analyzed which of the top biomarkers for depression and for mania after Steps 1–4 are known to be changed in expression by existing drugs in a direction opposite to the one in disease, using our CFG databases, and using Ingenuity Drugs analyses (Table 3 and Table S4).

New drug discovery/repurposing

We also analyzed which drugs and natural compounds are an opposite match for the gene expression signatures of our top biomarkers, using the Connectivity Map (https://portals.broadinstitute.org, Broad Institute, MIT) (Fig. 3 and Table 4). Of note, not all the probesets from the HG-U133 Plus 2.0 array we used were present in the HGU-133A array used for the Connectivity Map. We stayed with exact probeset level matches, not gene level imputation. We also used the NIH LINCS database to conduct similar analyses, at a gene level.

**Fig. 3: Therapeutics: matching with medications.**

Table 4 Therapeutics: drug repurposing for depression.

Full size table

Report generation

We present an example of how a report to doctors might look, using the above insights. We used a panel of top biomarkers after Steps 1–4 (Fig. 3 and Table 3): BioM12 + 1: n = 12 genes to generate a score for depression severity, as well as the mania biomarker RLP3 to inform risk for bipolar switching. Out of a dataset of 794 subject visits, we chose as a case study a visit from a female patient with depression who had died by suicide, a case previously discussed in a suicide biomarker paper of ours (Levey et al. [5]) (Fig. 4).

**Fig. 4: Example of report for physicians.**

First, we removed that patient from the dataset, and divided the remaining dataset into three populations: those who had a high HAMD score ≥22 (concordant with a low SMS-7 mood score ≤ 40), those who had a low HAMD score ≤7, and those who had an intermediate HAMD score. For the first two groups, we calculated the average Z-scored expression values for each biomarker in the panel. We then compared the levels of each biomarker, in each subject in the cohort, including the subject of interest, to these reference levels. If a biomarker was higher than the average of the high HAMD subjects it got a 1, if it was below the average of the low HAMD subjects it got a 0, and if it was in between it got a 0.5. Next, we averaged the biomarkers in the panel and multiplied by 100, to generate a score between 0 and 100 for the BioM12 for each of the 794 subjects, including the case study subject. This digitalization of the scores was done to avoid overfitting to our particular cohort, and provide an easily understandable and interpretable readout for clinicians. The score of the BioM12 is compared to the average score of BioM12 for the high HAMD subjects and the low HAMD subjects, generating 3 risk categories: high (red), intermediate (yellow), and low (green) for current depression severity. This rank percentile of the score of the patient compared to the distribution of scores of subjects in the database is also provided in the report (Fig. 4).

Second, future risk is assessed by looking how many of three biomarkers in the panel, that are good predictors of future hospitalizations for depression (NRG1, PRPS1, SMAD7), were a 1, generating 0 to 3 asterisks.

Third, we examined how many of the bipolar biomarkers (n = 6) in BioM12 were a score of 1. If more than 50% of them (more than 3 out of 6) were a 1, than the patient gets an asterisk for bipolar risk. If the mania biomarker RLP3 is also 1, then the patient gets another asterisk for risk of bipolarity, i.e., risk of switch if treated for depression. In those with asterisks for risk of bipolarity, it is advisable to choose mood stabilizers or antipsychotics from the medication choices provided by the report.

Fourth, for each biomarker in the panel, we also have a list of existing psychiatric medications that modulate the expression of the biomarker in the direction of high mood. Each medication got a score of 1 or 0 whether it modulated a particular biomarker in the panel or not, and that score is multiplied with the risk score of the biomarker, i.e., 1 or 0.5 or 0. A medication can modulate more than one biomarker. We then calculated an average score for each medication based on its effects on all the biomarkers in the panel, and multiplied that by 100, resulting in a score of 0 to 100 for each medication. Thus, psychiatric medications are matched to the patient and ranked in order of impact on the panel.

Fifth, we used the biomarkers that were positive as high risk in the panel, to interrogate the CMAP and do individualized drug repurposing, identifying new non-psychiatric compounds that could be used in that particular patient to treat depression (Fig. 4).

Results

In Step 1 Discovery, we identified candidate blood gene expression biomarkers that: (1) change in expression in blood between self-reported low-mood and high-mood states, (2) track the mood state across visits in a subject, and (3) track the mood state in multiple subjects. We used a visual analog measure for mood state (SMS-7). At a phenotypic level, the SMS-7 quantitates mood state at a particular moment in time, and normalizes mood measurements in each subject, comparing them to the lowest and highest mood that subject ever experienced. We then used a powerful within-subject and then across-subject design in a longitudinally followed cohort of subjects (n = 44 subjects, with 134 visits) who displayed at least a 50% change in the mood measure (from below 40/100 to above 60/100) between at least two consecutive testing visits, to identify differentially expressed genes that track mood state. Using our 33% of maximum raw score threshold (internal score of 2 pt) [4, 5], we had 11,620 unique probesets (corresponding to 9649 unique genes) from Affymetrix Absent/Present (AP) analyses and DE analyses (Fig. 1D). These were carried forward to the prioritization step. This represents approximately a fivefold enrichment of the 54,625 probesets on the Affymetrix array.

We also examined in the discovery cohort whether subtypes of low mood can be identified based on mental state at the time of low mood visits, using two-way hierarchical clustering with anxiety and psychosis measures. The mood state self-report may be more reliable in this cohort, as the subjects demonstrated the aptitude and willingness to report different, and diametric, mood states. We uncovered four potential subtypes of low mood/depression: high anxiety and low psychosis (anxious), high anxiety and high psychosis (combined), low anxiety and high psychosis (psychotic), low anxiety and low psychosis (pure low mood) (Fig. S3). These subtypes need to be tested in future studies in independent cohorts for practical utility, diagnostic and therapeutic.

In Step 2 Prioritization, we used a CFG approach to prioritize the candidate biomarkers identified in the discovery step (33% cutoff, internal score of ≥2 pt.) by using published literature evidence (genetic, gene expression and proteomic), from human and animal model studies, for involvement in mood disorders (Fig. 1E and Table S2). There were 6370 probesets (corresponding to 4960 unique genes) that had a total score (combined discovery score and prioritization CFG score) of 6 and above. These were carried forward to the validation step. This represents approximately a tenfold enrichment of the probesets on the Affymetrix array.

In Step 3 Validation, we validated for change in clinically severe mood disorders (depression, mania) these prioritized biomarkers, in a demographically matched cohort of (n = 30 clinically severe depression, n = 17 clinically severe mania), by assessing which markers were stepwise changed in expression: from clinically severe depression in validation cohort, to low mood in discovery cohort, to high mood in discovery cohort, and to clinically severe mania in the validation cohort (Fig. 1). Four thousand six hundred thirty-three probesets were not stepwise changed, and 1737 were stepwise changed. Of these, 291 probesets (corresponding to 283 unique genes) were nominally significant. This represents approximately a 188-fold enrichment of the probesets on the Affymetrix array.

Adding the scores from the first three steps into an overall CFE score (Fig. 1), we ended up with a list of 26 top candidate biomarkers (n = 23 genes, 26 probesets), that had a CFE score as good as or better than SLC6A4, which serves as a de facto positive control and which we decided to use as an empirical cutoff. This represents approximately an over 2000-fold enrichment of the probesets on the Affymetrix array.

The list of 23 genes (26 probesets) (Fig. 1) is composed of genes increased in expression in high mood (TMEM161B, GLO1, PRPS1, SMAD7, ANK3, OGT, CD47, GLS, TMEM106B, RPL3, FANCF, HNRNPDL, DOCK10, CALM1), and genes decreased in expression in high mood (NRG1, OLFM1, SPECC1, SORT1, TPH1, GSK3B, MARCKS, NR3C1, and SLC6A4). These 26 top candidate biomarkers were carried forward into analyses for understanding biological underpinnings. Last but not least, they were tested for predictive ability and clinical utility in additional independent cohorts.