Introduction

The treatment of bipolar depression is a major clinical challenge and no conventional antidepressants have been approved by the FDA for the short-term treatment of bipolar depression1. Only three pharmacotherapies are FDA-approved for bipolar depression; the combination of olanzapine and fluoxetine (OF), quetiapine monotherapy (QTP), and lurasidone as a monotherapy or adjunctive therapy. These treatments produce numbers needed to treat (NNT) for response of 4 for OF, 6 for QTP, and 5–7 for lurasidone, respectively2, and all three produce significant side-effects with numbers need to harm (NNH) of 6 for OF (e.g., weight gain), 5 for QTP (e.g., sedation), and 15–16 for lurasidone (e.g., akathisia and nausea), respectively2. Commonly used agents which do not have a labeled indication for treatment of the depressed phase of bipolar disorder, such as lamotrigine (NNT = 12; NNH = 37) and lithium (NNT = 15; NNH = 38), are modestly efficacious2, 3. Thus, new classes of medication are needed.

Given that an inflammatory-like state exists in a subgroup of patients with BD, there is increasing interest in the therapeutic potential of immune-modulating medications4,5,6,7. Two particularly promising candidates are minocycline and low-dose aspirin, as reviewed in the initial published protocol8 and summarized here: both medications are well-tolerated, even with long-term use, well absorbed and brain penetrant, and likely exert anti-inflammatory effects in the brain and the periphery.

Aspirin inhibits cyclooxygenase-1 (COX-1) and acetylates COX-2, blocking the conversion of arachidonic acid to prostaglandins and thromboxane A2. At the low dose used in this study, aspirin preferentially inhibits COX-1 while at higher doses it additionally reduces COX-2 function. COX-1 is predominantly expressed by microglia and macrophages while COX-2 is predominantly localized to neurons9. Importantly, preclinical evidence suggests that inhibition of COX-1 is neuroprotective after intracerebroventricular administration of lipopolysaccharide (LPS) whereas inhibition of COX-2 is detrimental, increasing leukocyte recruitment into the brain and exacerbating tissue damage10. Notably, a pharmacoepidemiological study demonstrated that individuals treated with lithium and low dose aspirin (≤80 mg/day) were less likely to have a medication event (medication switch or dose change) whereas high-dose aspirin, non-selective NSAIDs, and glucocorticoids were associated with an increase in medication events11. While there are no published controlled clinical trials of aspirin as an antidepressant, an open-label study reported aspirin increased the speed of response to SSRIs12, and an epidemiological study reported that aspirin protected against depression in older men with elevated levels of homocysteine13.

Minocycline, a tetracycline antibiotic, exerts a variety of biological actions that are independent of its anti-microbial activity, such as anti-inflammatory and anti-apoptotic activities, and inhibition of proteolysis, angiogenesis, and tumor metastasis. Minocycline modulates immune function via multiple mechanisms—for instance, inhibiting the activation, migration, and/or proliferation of T-cells, neutrophils, and microglia, inhibiting the release of pro-inflammatory cytokines, and increasing the release of anti-inflammatory and anti-apoptotic molecules14. In preclinical models, minocycline exerts neuroprotective effects, including reversing inflammation-induced inhibition of neural stem cell proliferation15 and reducing lesion size and/or demyelination in neurological disorders, such as Huntington’s disease16. Antidepressant-like effects have been reported in rodents17 and anecdotally in humans18, 19, as well as in an open-label trial for bipolar depression 20. In addition, a double-blind, placebo-controlled, randomized trial of minocycline for mild-to-moderate depression in HIV patients showed therapeutic effects21. No randomized controlled trial has been published in patients with primary mood disorders.

Here we perform the first randomized, placebo-controlled trial of aspirin and minocycline to assess their efficacy as adjunctive treatments for bipolar depression alone and together. Given the well-established safety profiles of these drugs, a 2 × 2 design was chosen for study as it permitted an opportunity to test two mechanistically different anti-inflammatory agents. Second, it allowed an opportunity to test for whether these mechanisms augmented or interfered with each other. If either aspirin or minocycline proved to be ineffective while the other agent was effective and both were well tolerated, then the two cells with the effective agent (i.e., effective agent plus placebo and effective agent plus the ineffective agent) could be combined and compared to the double placebo group combined with group which received the ineffective drug plus placebo, essentially doubling the statistical power of the study. On the other hand, if the two drugs were found to either augment or interfere with each other, then combining the groups to increase power would not be possible but valuable clinical information would be obtained.

Methods

Study design

This was a multi-site, double-blind, placebo-controlled clinical trial in which as adjunctive therapy to existing treatment, participants initially were randomized (1:1:1:1) to one of 4 groups: (a) active minocycline (100 mg b.i.d., p.o.) + active aspirin (81 mg b.i.d., p.o.) (M + A); (b) active minocycline (100 mg b.i.d., p.o.) + placebo aspirin (M + P); (c) placebo minocycline + active aspirin (81 mg b.i.d., p.o.) (A + P), and (d) placebo minocycline + placebo aspirin (P + P). At the mid-way point when 60 patients had completed the study, an adaptive design approach was taken to perform a blind interim analysis to test for futility. This blind interim analysis, performed by a statistician, found that two groups appeared to be separating from each other. A new power calculation was performed using the data from these two groups (i.e., the potential effect size and variability) and determined that 30 participants in these two cells would yield sufficient power to test for a difference in their respective response rates. In contrast, sufficient power was not available to detect group differences in the mean change in MADRS scores over time. Subsequent to the interim analysis, WCD, who did not assess participants and was not at any of the sites at this point in the study, was unblinded and observed that the M + A group was separating from the P + P group. This information was not conveyed to the clinical teams conducting the study. In addition, at this point in the study, it was apparent that we could only enroll 100 participants rather than the original goal of 120 and stay within our study budget. For these reasons, the design was adapted by stopping future enrollment into the A + P and M + P groups and instead randomization of all future participants (projected to be 36 based on the historical rate of participant accrual in the study) was to the M + A or P + P groups, resulting in the numbers per group included in the final statistical analyses of: 30 (M + A), 18 (M + P), 19 (A + P), and 28 (P + P). See CONSORT diagram (Fig. 1). The reason the projected number of additional subjects was 36 rather than 40 is because four more subjects (i.e., a total of 64) were already enrolled in the study but had not completed the study when the interim analysis was performed.

Fig. 1
figure 1

CONSORT flow diagram showing the number of individuals assessed for eligibility in person, the number of participants randomized to each group, the number of individuals lost to follow-up at visit 7 (week 6) and the number of individuals included in the statistical analyses

Parenthetically, adaptive trial designs allow for the modification of aspects of a study in a real-time, data-driven manner, and thus can greatly enhance the efficacy of early drug development22. The adaptive design is usually employed in early studies to test either many doses of a drug or different combinations or schedules of administration (e.g., oncology), and then is followed by at least one confirmatory conventional study for a new drug application to a regulatory agency, such as the Food and Drug Administration.

The duration of the trial was 6 weeks and comprised 7 visits (baseline and weekly follow-up, Figure S1). The study was conducted at three sites: the Laureate Institute for Brain Research (LIBR) in Tulsa, the University of Oklahoma College of Medicine (OU), Tulsa, and the University of Kansas School of Medicine (KUSM-W) in Wichita. The Western IRB was used as the central IRB for the three sites and this use was approved by the IRBs of OU and KUSM-W. The study protocol was published prior to the trial8.

Participants

Participants were recruited from psychiatric clinics associated with the study sites, as well as the general community through radio and print advertisement. The treatment that the participants received prior to study enrollment was determined by their treating clinician.

Inclusion criteria

(a) meeting DSM-IV-TR criteria for BD I (n = 37), BD II (n = 57), or BD NOS (n = 5) based on the MINI-Plus and a clinical interview by a study psychiatrist; (b) current major depressive episode of ≥4 weeks duration, (c) at least moderately depressed, i.e., a Quick Inventory of Depressive Symptomatology (QID-C16) score ≥10; (d) stable regimen of medication for ≥4 weeks prior to enrollment (if receiving therapy), (e) 18–65 years of age. Exclusion criteria are detailed in the supplement and a summary of the psychiatric medications taken by the participants during the trial appears in Table S1.

The study was approved by the western IRB and participants provided written informed consent. The trial was publically registered (www.clinicaltrials.gov): NCT01429272. Minor amendments to the published pre-study protocol are as follows: (a) The threshold for depression severity originally was a score of at least 18 on the HAM-D 17. We switched from the HAM-D to the QID-C 16 because of its better parametric characteristics but the severity threshold remained identical: a score of 10 on the QID-C 16 is equivalent to an 18 on the HAM-D 17. (b) The proposed age range of 18–55 was altered to 18–65 to facilitate recruitment. (c) Similarly, in order to facilitate recruitment, we broadened the entrance criteria to allow for the inclusion of participants with a diagnosis of BD NOS. (d) A blind interim analysis was performed leading to the dropping of two arms from the study. (e) We used more rigorous criteria for response and remission than was specified in the protocol. The MADRS-based criteria for response (>50% decrease in MADRS scores) and remission (MADRS score of <11) were retained as these are standardly used in the field23. However, in the protocol we stated that these criteria would apply at the final visit of the participant whereas here we required the 50% decrease in MADRS and/or the MADRS score of <11 to be present for the final two consecutive visits in line with recent FDA advice. (f) Time of blood draw was not used as a covariate in the immune analyses because this information was not available for all participants.

To assess and enhance participant adherence to study medication, participants were given an information sheet to take home detailing the procedure to be followed in the case of a missed dose, and requesting that this information be recorded for the investigators. Returned trial medication was audited. Participants were permitted to miss no >50% of their study medications in a single week on more than one occasion. Non-adherent participants received additional counseling about the need for adherence beyond what was routinely given each week. A second episode of non-adherence would have resulted in withdrawal from the study but did not occur. We also obtained a post-study measure of medication adherence for the participants receiving aspirin, i.e., urinary concentrations of thromboxane B2 (11-D-TXB2), a downstream metabolite of prostaglandin H2, that is robustly decreased by treatment with aspirin24.

Randomization and masking

Participants were randomly assigned to one of the four arms in a double-blind fashion via a central database hosted at LIBR. Randomization was conducted according to CONSORT guidelines using block permutations. Randomization codes were generated by a computer-based random number generator and were held by LIBR staff not involved in the trial until study completion. The investigators and participants were blind to the treatment allocation. Blindness was maintained by ensuring that the packaging, appearance and color of the minocycline, aspirin, and placebo capsules were identical. Commercial minocycline and aspirin tablets were purchased by the manufacturer (Wedgewood Pharmacy, Swedesboro, NJ) and over-encapsulated. Matching placebo capsules containing lactose were produced.

Biological samples

Serum samples were obtained at baseline (n = 90) and visit 7 (n = 80) to measure inflammatory biomarkers. CRP and interleukin-6 (IL-6), were measured using a Meso Scale Discovery (MSD) QuickPlex SQ 120 instrument and MSD V-PLEX assays (CRP: LLOQ = 0.05 mg/L; CV = 1.9%; IL-6: LLOQ = 0.04 pg/mL; CV = 3.6%). A spot morning urine was taken and a 11-Dehydro Thromboxane B2 EIA kit (Cayman Chemical Company; Ann Arbor, MI) used to quantify 11-D-TXB2 (LLOQ = 46 pg/mL; CV = 4.3%), an indirect marker of COX activity that has been used to measure adherence to aspirin therapy in cardiovascular and cerebrovascular diseases24. For purposes of comparison, we also obtained two 11-D-TXB2 measures from a healthy control sample (n = 27), 6 weeks apart.

Outcome measures

Based on the results of the interim analysis (see Study Design above), two of three original primary outcome measures, i.e., the change in MADRS scores across visits and remission rate were reduced to exploratory outcome measures. The third original primary outcome variable, i.e., response to treatment was maintained. Sustained response to treatment was defined as a >50% reduction23 in Montgomery–Äsberg Depression Rating Scale (MADRS) for the final two consecutive visits of each participant25. The exploratory outcome measure, remission, was defined as a post-treatment MADRS score of <1123 for the final two consecutive visits25. Other exploratory outcome measures were the mean change in MADRS score, Clinical Global Impression Improvement (CGI-I) score, and Hamilton Anxiety Scale (HAM-A) score over time.

Adverse events were recorded at each visit and were posed as open-ended questions about any issues with the trial or the study medication in accordance with standard FDA guidance for the execution of such clinical trials (Table S2). Safety was monitored by the study PI, a biweekly consensus meeting of the principal investigator (PI) and study staff at each site, and a Data, Safety and Monitoring Board (DSMB) which met biannually.

Hypotheses

Primary hypothesis

Hypothesis 1:

Participants in the M + A group will show a greater sustained response rate than participants in the P + P group.

Secondary hypotheses

Hypothesis 2:

Across all four groups, participants receiving aspirin and/or minocycline will show a greater response rate than participants in the P + P group.

Hypothesis 3:

The efficacy of aspirin and/or minocycline treatment will be predicted by baseline inflammation such that patients receiving active treatment with higher levels of IL-6 and CRP will show a greater response rate to treatment. Additionally, participants who show a greater decrease in IL-6 and CRP concentrations between visits 1 and 7 will show a greater response to treatment.

Exploratory analyses (EA)

(EA 1). Compared to the P + P group, patients receiving aspirin and/or minocycline will show a greater remission rate.

(EA 2). Compared to the P + P group, patients receiving aspirin and/or minocycline will show a greater decrease in mean MADRS scores over time.

(EA 3). Compared to the P + P group, patients receiving aspirin and/or minocycline will show a greater decrease in CGI-I scores over time.

(EA 4). Compared to the P+P group, patients receiving aspirin and/or minocycline will show a greater decrease in mean HAM-A scores over time.

Statistical analysis

Analyses were performed with R26. Analysis of Variance (ANOVA) or the χ2-test was performed to test for baseline differences in demographic and clinical variables across the four treatment arms. Concurrent medications were coded into four classes, i.e., antidepressants, anticonvulsants, antipsychotics, and anxiolytics (Table S1). There was no significant difference between groups in numbers of participants per medication class, age, sex, and body mass index (BMI). Results also did not differ significantly across sites.

Between group differences in response rate were tested with logistic regression with the analysis of deviance test. To evaluate the effects of baseline inflammation on response rate, IL-6 and CRP were entered as additional binary (i.e., low vs. high based on a median split) variables. Variables were selected as regressors according to whether their inclusion improved the model measured by the Akaike information criterion (AIC) score. Based on the AIC score, no covariates (i.e., age, sex, body mass index (BMI), medication class nor study site) were included in the models as covariate regressors for the logistic regression analyses.

For the post hoc analyses, differences in remission rate were tested with logistic regression with the analysis of deviance test. Linear Mixed Effect (LME) model analyses were performed to measure the temporal change in mood ratings (MADRS, CGI-I, and HAM-A scores) across visits. Age, sex, and BMI were included as fixed effects based on the AIC. Random effects included subject and autoregressive covariance structure across visits within subject (see Supplement).

For the primary and secondary hypotheses, we used a standard statistical significance threshold of p < 0.05. For the exploratory analyses, we used a Bonferroni-corrected statistical threshold of p < 0.013 (to correct for four tests).

Role of the funding source

The funders of the study had no role in the study design, data collection, data analysis, data interpretation, or writing of the report. The corresponding authors had full access to all the data in the study and had final responsibility for the decision to submit for publication.

Results

Ninety-nine participants were randomized out of a total of 201 individuals who underwent in-person screening (Fig. 1). The recruitment period was from March 2012 to August 2015. Initially, we believed 100 participants had been randomized and hence closed the study; however, one person was erroneously counted twice. Seventy-five participants completed all visits. Four participants did not complete any post-baseline visits and were excluded from the analyses. The difference in urine 11-D-TXB2 concentrations between baseline and visit 7 demonstrates target engagement of COX by aspirin in the groups that received treatment with aspirin (Figure S2). Baseline characteristics of the sample and the number of participants who completed each visit are shown in Tables 1, 2, respectively.

Table 1 Demographic and clinical characteristics of the M + A, M + P, A + P, and P + P groups at baseline
Table 2 MADRS scores and inflammatory biomarker concentrations at visits 1–7 (Mean ± SD)

One participant in the M + A group developed a hypomanic episode during the study and was withdrawn. Nevertheless, there was no difference between groups in the Young Mania Rating Scale (YMRS) scores at visit 7 (F3,71 = 1.44, p = 0.237). Parenthetically, the YMRS was used to prospectively test for any evidence that minocycline and/or aspirin could precipitate cycling into hypomania or mania. The dropout rate did not differ significantly across groups (χ23 = 0.83, p = 0.842). There was no significant difference across the four arms in the number of participants who were being treated with antipsychotics (χ23 = 0.41, p = 0.932), mood stabilizers (χ23 = 0.45, p = 0.928), antidepressants (χ23 = 0.87, p = 0.832), or anxiolytics (χ23 = 2.01, p = 0.530) at study entry. Further, there was no difference between groups in the number of individuals with a diagnosis of BD I, BD II, and BD NOS (χ26 = 5.54, p = 0.476). The adverse events per group appear in Table S2. The main results of the study are summarized in Table S3.

Hypothesis 1

Participants receiving minocycline plus aspirin had a better response rate compared with participants receiving double placebo (χ12 = 3.35, p(1t) = 0.034, odds ratio (OR) = 2.93, 95% confidence interval (CI) = 0.93–10.08, NNT = 4.7). When pre- vs. post interim analysis status was used as a covariate in the logistic regression, the M + A group still showed a significantly better response rate than the P + P over the entire trial (χ12 = 3.35, p(1t) = 0.034). There was no main effect of pre- vs. post interim analysis status (χ12 = 0.002, p = 0.961) or interaction between pre- vs. post interim analysis status and study arm (χ12 = 1.35, p = 0.246). Based on these two analyses, no bias in the results was introduced by the interim analysis.

Hypothesis 2

There was a significant main effect of aspirin on the clinical response rate (χ12 = 5.52, p(2t) = 0.019 with analysis of deviance test, (OR = 3.67, 95% CI = 1.03–14.06) but no significant effect of minocycline (χ12 = 0.01, p = 0.911) or interaction between aspirin and minocycline (χ12 = 0.19, p = 0.659) (Fig. 2). The NNT to obtain a response to aspirin (M + A and A + P vs. M + P and P + P) was 4.2. The NNT for the M + A and A + P vs. the P + P comparison was 4.0.

Fig. 2: Percentage of responders (y-axis) in each of the 4 treatment groups shown individually (top panel) and the two aspirin groups (M + A and A + P) vs. the two non-aspirin groups (M + P and P + P) (bottom panel)
figure 2

Top panel: participants receiving minocycline plus aspirin showed a better response rate compared with participants receiving double placebo (χ12 = 3.35, p(1t) = 0.034, odds ratio (OR) = 2.93, NNT = 4.7). Bottom panel: there was a significant main effect of aspirin on the clinical response rate (χ12 = 5.52, p(2t) = 0.019, OR = 3.67) but no significant effect of minocycline (χ12 = 0.01, p = 0.911) or interaction between aspirin and minocycline (χ12 = 0.19, p = 0.659). The NNT to obtain a response to aspirin (M + A and A + P vs. M + P and P + P) was 4.2. The NNT for the M + A and A + P vs. the P + P comparison was 4.0. #p < 0.05 (one-tailed test); *p < 0.05 (two-tailed test)

Hypothesis 3

There was a significant 3-way interaction between aspirin, minocycline, and IL-6 (χ21 = 7.08, p(2t) = 0.008) on response rates. Follow-up analysis showed that participants in the M + P group with higher baseline IL-6 levels responded better to minocycline than participants in the M + P group with lower IL-6 concentrations (χ21 = 7.72, p(2t) = 0.005, Fig. 3). Further, there was a significant interaction between the change in IL-6 concentrations and treatment response (χ23 = 9.69, p(2t) = 0.001). Follow-up testing indicated that participants in the M + P group who responded to treatment had a significantly greater decrease in IL-6 levels between baseline and visit 7 compared with non-responders in the M + P group. There were no significant interactions between aspirin and CRP or minocycline and CRP, or the 3-way interaction between aspirin, minocycline, and CRP (all p’s > 0.1).

Fig. 3: Box-and-whisker plots showing the 4 treatment groups divided into responders (green) and non-responders (blue)
figure 3

The box shows inter quartile range (IQR, i.e., 25–75%) around the median (line in a box) and the whisker shows data range (minimum to maximum values) except for outliers (diamonds). The natural logarithm of the serum IL-6 protein concentration is shown on the y-axis. There was a significant 3-way interaction between aspirin, minocycline, and IL-6 (χ21 = 7.08, p(2t) = 0.008) on response rates. Follow-up analysis showed that participants in the M + P group with higher baseline IL-6 levels were more likely to be classified as treatment responders than participants in the M + P group with lower IL-6 levels (χ21 = 7.72, p(2t) = 0.005). **p < 0.01 (two-tailed test);  = outlier, defined as a data point outside of 75% + 1.5*IQR or 25%–1.5*IQR

Post hoc analyses

There was a main effect for aspirin on the remission rate that did not remain significant after Bonferroni correction (χ12 = 4.14, q(2t) > 0.5, p(2t) = 0.042, uncorrected, OR = 2.52, CI = 0.56–12.29, Figure S3). There was no main effect for minocycline (χ12 = 0.45, p = 0.503, uncorrected) or the interaction between aspirin and minocycline (χ12 = 0.35, p = 0.554, uncorrected). The NNT for aspirin to obtain remission (M + A and A + P vs. M + P and P + P) was 6.5. The NNT for the M + A and A + P vs. P + P comparison was 8.0. There were no significant main effects of aspirin or minocycline on the mean change in MADRS, CGI-I, or HAM-A scores from baseline, nor were there significant interactions between aspirin, minocycline, and visit (Figures S4S6).

Discussion

This is the first randomized controlled trial of adjunctive minocycline and/or aspirin for the treatment of bipolar depression. There were two principal findings.

First, consistent with our findings from the interim analysis, there was a significantly greater sustained clinical response rate in the M + A group (44%) vs. the P + P group (21%, NNT = 4.7) with participants receiving minocycline plus aspirin approximately twice as likely to show a sustained response vs. those individuals receiving double placebo. For this specific analysis, we performed a one-tailed test because of our a priori hypothesis (based on the interim analysis) that the M + A group would show a greater response rate than the P + P group. However, irrespective of whether a one or two-tailed test is performed, the large effect size (OR = 2.93) raises the possibility that the combination of minocycline and aspirin may be an efficacious adjunctive treatment for bipolar depression. Further studies are needed to test whether the combination of minocycline and aspirin exerts a synergistic effect that is superior to either drug alone.

Given the initial 2 × 2 design, by combining groups as proposed in the original published protocol, we also could test for the main effects of the aspirin alone (i.e., A + P) or together with minocycline (i.e., M + A) vs. P with or without minocycline (i.e., P + P and/or M + P). Similarly, we also could test for the main effects of the minocycline alone (i.e., M + P) or together with aspirin (i.e., M + A) vs. P with or without aspirin (i.e., P + P and/or A + P). When all four treatment arms were included in the analysis (hypothesis 2), there was a significant main effect of aspirin (A + P and M + A combined) on sustained response rates with NNTs of between 4 (for A + P and M + A vs. P + P) and 4.2 (for A + P and M + A vs. P + P and M + P), i.e., equal to or superior to currently approved treatments for bipolar depression.

The second main finding was that participants with higher baseline levels of IL-6 responded better to minocycline administration than patients with lower levels of inflammation (hypothesis 3). These data resemble findings from a recent clinical trial of the TNF inhibitor, infliximab, for treatment resistant depression in which no overall difference in the change in depression ratings was detected between treatment groups across time27. However, infliximab-treated patients with a baseline CRP concentration >5 mg/L had a greater decrease in depression ratings compared to the placebo group27. The results from both the infliximab and the current study suggest that anti-inflammatory agents may not be helpful for treating depressed patients who do not manifest inflammation. In addition, the minocycline-treated participants who showed a greater decrease in IL-6 levels between baseline and visit 7, also showed a larger reduction in depressive symptoms over the trial. This result is consistent with recent studies reporting that depressed patients who respond/remit to treatment with an SSRI or electroconvulsive therapy show a greater decrease in IL-6 levels over time compared with non-responders or remitters28, 29. Conceivably, IL-6 could be used in future to predict which patients are likely to respond to minocycline.

The absence of a significant interaction between the efficacy of aspirin treatment and baseline levels of CRP and/or IL-6 may reflect Type II error given the relatively small samples, but also may reflect the clinically non-significant anti-inflammatory effect of low-dose aspirin in autoimmune or other inflammatory disorders. Our results therefore suggest that the therapeutic effect of aspirin may be attributable to a still unknown mechanism, conceivably the effect of COX-1 inhibition on the arachidonic acid cascade30 or neurotrophic processes31. Arachidonic acid not only is released by unregulated phospholipase A2 activation associated with neuroinflammation, leading to excess production of COX-mediated inflammatory metabolites, but also is a second messenger released during neurotransmission via dopaminergic (D2), glutamatergic (NMDA), serotonergic (5-HT2A, 5-HT2C), and muscarinic (M1, M3, and M5) receptors, raising the possibility that inhibition of COX by aspirin may modulate disturbed neurotransmission in bipolar depression30. In this regard, this first report of higher baseline 11-D-TXB2 levels in the BD sample relative to the control sample (Figure S2) is noteworthy as it suggests that the activity of the arachidonic acid pathway is elevated in a subset of individuals with BD, consistent with previous hypotheses30.

Strengths of the study include: (a) the placebo-controlled assessment of two novel therapeutic agents, with the 2 × 2 design allowing us to evaluate potential additive and antagonistic drug interactions, (b) the use of peripheral blood biomarkers to assess the effect of inflammation on treatment outcome, (c) target engagement of aspirin (based on the pharmacodynamic effects of COX inhibition) in the aspirin groups, and (d) the representative nature of our sample for individuals with bipolar depression, many of whom were inadequately responsive to existing treatments, which remains a serious unmet medical need.

Although we attempted to assess the effects of adjunctive medications by broadly grouping medications by class, we did not control for differences in dose, pharmacokinetics, and drug–drug interactions. This is a limitation of the study.

In sum, the study provides preliminary support to the possibility that aspirin and minocycline can be efficacious adjunctive therapies for the treatment of bipolar depression. These findings should encourage further studies in larger samples perhaps using markers of inflammation as inclusion criteria to increase statistical power. Independent confirmation of the therapeutic efficacy of low-dose aspirin has significant potential to advance the treatment of depression given its global availability and affordability, and its relatively benign side-effect profile.