Introduction

Depressive episodes are characteristic for both, major depressive disorder (MDD) and bipolar disorder (BD)1,2,3,4. Because the diagnostic criteria for depressive episodes are the same in both disorders5,6, establishing the correct diagnosis is difficult yet important as each requires different treatment1,7,8.

Several clinical characteristics and biological markers have been shown to increase the probability of diagnosing BD rather than MDD9,10,11, but none of them is included in diagnostic criteria, nor generally accepted without further exploration. The presence of mania in medical history remains the only accepted diagnostic marker that differentiates the two disorders. In addition to often unavailable or inconclusive psychiatric history, the diagnostic process is even more difficult given that episodes of BD depression compared to episodes of mania are longer, more frequent, and predominantly present at the beginning of BD12,13,14. Misdiagnosis is common and presents a dire problem: up to 40–60% of bipolar patients are misdiagnosed as unipolar and only 20% receive the correct diagnosis within the first year which may considerably increase risks of inappropriate drug prescription, switching to mania, prolonged illness duration, risk of recurrence, suicide and overall poorer treatment responses7,15,16,17,18. Hence, biologically relevant diagnostic markers of BD and MDD depressions would significantly improve the diagnostic process, yet are still unavailable.

While the pathological processes in psychiatric disorders (e.g. MDD and BD) are mostly unknown and the diagnosis is not data-driven and remains a clinical decision, the field of neuroimaging may contribute considerably in identifying biomarkers19. If found, rapidly growing advances in technology and data analysis may facilitate the utility of such biomarkers on levels of specificity and sensitivity that is relevant even for clinical use.

Resting-state functional magnetic resonance imaging (rs-fMRI) is a neuroimaging modality that focuses on low-frequency spontaneous fluctuations of blood oxygen level-dependent (BOLD) signal20,21. As opposed to the paradigm or task-based fMRI, in rs-fMRI the participants are usually asked to lay still in the scanner, with eyes closed or fixated on a cross-hair, and do nothing in particular. Depending on the used method of analysis, rs-fMRI enables to examine patterns of neural activity and connectivity at rest22. Rs-fMRI has a high potential for clinical use as it is non-invasive, safe, without task demands on the participants, relatively short, and well-tolerated even by severely depressed individuals23.

Presuming depressive episodes manifest with different activity in neurocircuitry in BD and MDD, these should be reflected in rs-fMRI scans. Numerous studies attempted to use rs-fMRI and various approaches to data analysis to find resting-state differences between patients with bipolar and unipolar depression. Mostly dorsolateral prefrontal, limbic, and midline structures were reported as differentiating the disorders in various measures of resting-state3,24, but no conclusive convergence of the results was reported yet. This inconsistency may arise from the diversity of methods used in the analyses, and the heterogeneity of the compared samples, which often differ across studies in size, clinical state, and comorbidities. Also, in fMRI studies, the cluster-wise inference (CWI) approach is often used in statistical analyses. This may present a problem as it was recently suggested that CWI in parametric statistical methods may lead to inflated false-positive rates, especially when lenient or uncorrected cluster-defining threshold is used25. Thus, heterogeneity may well result from false positives findings.

A quantitative meta-analytical approach may compensate for several of the aforementioned issues. It can attenuate the false-positive rates and the problem of sample comparability in terms of size and psychopathological variability26,27. To further enhance comparability across the studies, current psychosis should also be an exclusion criterion being more prevalent in BD compared to MDD28. In addition, psychotic mood episodes may be associated with additional brain abnormalities relative to nonpsychotic episodes and another confounding variable would have to be addressed. Finally, considering relatively indistinct diagnostic boundaries between bipolar disorder II and MDD, and metabolism29 and genetic30,31 studies suggesting a neurobiological difference between BD I and II, rather the narrow phenotype of BD I should be preferably investigated.

The uniformity of analytical methods is another challenging problem. Namely, there are only a few studies that compared depressive episodes in MDD and BD and used identical analytical methods. One way to overcome this is to focus on spatial information, e.g. brain-regions or their anatomical coordinates, of the reported differences in resting-state and include rs-fMRI studies regardless of the used analytical method32,33,34,35,36,37. However, this still limits the inclusion process to studies that do not restrict their rs-fMRI scans and following analysis to any particular brain regions. This approach also does not provide information on the nature of these differences such as functional connectivity or revealing functional networks. Instead, it provides identification of brain regions that have a high probability to contain information relevant for distinguishing BD from MDD depression.

The present meta-analysis aims to identify converging spatial information associated with alterations in spontaneous brain activity that differentiate patients with BD and MDD depression, using activation likelihood estimation (ALE)36. To enhance the validity of the results, the meta-analysis focuses on whole-brain rs-fMRI studies, with currently depressed, non-psychotic BD and MDD participants. If found, identified brain regions may be targeted for testing of new hypotheses and the development of neural biomarkers for the diagnosis of BD and MDD.

Results

The inclusion process resulted in ten eligible studies38,39,40,41,42,43,44,45,46,47 with nine samples (two studies were performed on the same sample40,41) with a total of 234 BD and 296 MDD subjects. From the included studies, 7 experiments with 16 foci were extracted for the BD > MDD contrast and 6 experiments with 13 foci were extracted for the MDD > BD contrast.

Pooled analysis weighted by sample size of each study revealed only trend to younger age (BD: meanpooled ± SDpooled 30.4 ± 10.3 years; MDD: 32.5 ± 11.2 years; t = − 1.64, p = 0.10), and lower female proportion (BD: 54.1 ± 9.9%; MDD: 61.8 ± 8.8%; χ2 = 3.18, p = 0.07), but lower severity of depression (HAMD; BD: 23.8 ± 7.9; MDD: 25.5 ± 7.2; t = − 2.11, p = 0.03) in BD compare to MDD participants. Two studies included medication-free participants, while in two studies medication status was not specified. In remaining studies (n = 5; BD = 150, MDD = 194), comparable use of antidepressants (BD 38%, MDD 46%, p = 0.14), but higher rate of antipsychotics (BD 26%, MDD 4%, p ˂ 0.001), lithium (BD 12%, MDD 0%, p ˂ 0.001), and other mood stabilizers (BD 32%, MDD 2%, p ˂ 0.001) was found in BD patients.

All included studies had an upper-medium to high quality. The summary of the studies is detailed in Table 1.

Table 1 Studies included in the meta-analysis, methods of analysis, demographics, clinical data, medication, between-group contrasts, number of foci, main study outcome and study quality.

For the BD > MDD comparison, the ALE meta-analyses did not find any significant converging clusters (Table 2). For the MDD > BD comparison, the ALE meta-analysis identified a cluster covering the left insula and adjacent area overlapping the left claustrum (Table 2, Fig. 1).

Table 2 Meta-analysis results of the resting state activity differences in subjects with MDD compared to BD in both directions.
Figure 1
figure 1

Axial and coronal view displaying the cluster of increased activity in unipolar depression compared to bipolar depression. Red to yellow: significant meta-analytic results (p < 0.05).

Discussion

This ALE meta-analysis found consistently higher resting-state activity in left insula and adjacent area covering the claustrum in currently depressed MDD subjects relative to currently depressed BD subjects. No increased activity was found in BD subjects compared to MDD. The main implication of this finding is that the left insula may be involved in neural activity patterns during the resting state that can be potentially used as biomarkers differentiating both disorders.

In this study, only whole-brain rs-fMRI studies unrestricted to pre-selected brain-regions with currently depressed, nonpsychotic, and BD I and MDD participants were included. This led to a drop in included studies, but also increased the overall validity of the results. Assuming the main heterogeneity of findings in included studies resulted from the variability in imaging acquisition method and statistical analyses, ALE focused on spatial convergence was found to be the best meta-analytical option for this type of study25,34.

Cluster level FWE correction was applied in the meta-analysis which is regarded as the most appropriate method for statistical inference when using ALE34. In individual comparative studies, CWI is considered to be prone to false positive findings25. However, to use a voxel-wise inference in ALE would be too strict, especially when a rather small number of included studies was used. Therefore, to mitigate the CWI bias, in this ALE meta-analysis a recommended two-level thresholding with stricter thresholds was used32. First, on the voxel-level, the forming of clusters in the ALE image was thresholded on an uncorrected p < 0.0001. Second, the emergent clusters were tested on a threshold of p < 0.05 against 5000 permutations which were generated based on the original data sets. Thus, the surviving clusters should represent a true non-random convergence across the studies.

Next to the utility of the found regions as potential targets for future studies, the question arises what does the convergence of increased local activity in MDD relative to BD mean. ALE provides identification of non-random spatial convergence of included foci across the studies35. Higher activity in insula in MDD relative to BD as revealed by ALE may then represent an increased local neural activity, local over-connectivity, or increased connectivity with other regions of the brain. Furthermore, an increase in MDD relative to BD may not correspond to an increase relative to healthy subjects.

In support of our findings, there is emerging evidence of increased insular activity38,40,48,49 and connectivity50,51,52,53,54 in MDD relative to BD and controls. In addition, decreased regional metabolism in medication-free BD compared to controls was found in left dorsolateral and midline brain-regions, including the insula55. Moreover, while structural changes may not necessarily correspond with an increase or decrease in brain-region activity or connectivity, insular grey matter thickness was reported to be increased in MDD relative to controls56,57, and decreased in BD58,59 even before the onset of the illness. The decrease in grey matter volume in the insular and anterior cingulate cortex, both nodes of the salience network60, was also found in first-degree relatives of BD61. Reduction in gray matter volume in these structures, and potentially associated dysregulation of the salience network, may indicate an endophenotype of BD.

There are also studies that suggest opposite or mixed findings41,62,63,64,65. Nevertheless, they still indicate abnormalities located in the insula and structurally or functionally adjacent brain-regions and pathways (i.e. claustrum), which displayed differences in MDD relative to BD. Taken together, there is a growing body of evidence supporting that insular activity and connectivity is increased, or at least different, in resting-state MDD relative BD (and controls).

Increased activity and connectivity in insula in MDD may be interpreted in line with current knowledge about the functional properties of the structure and clinical observations. The insula is a multimodal integration region that evaluates the emotional and motivational salience of external and internal stimuli66,67,68. Typically, two functionally distinct regions are recognized in the insula68. First, visceral, somatosensory, vestibular, and motor inputs are relayed to the posterior insula. From there, they are forwarded to the second part, the anterior insula, where the re-representation of these inputs is integrated with emotional, cognitive, and motivational signals collected from cortical and subcortical regions. The anterior insula together with the anterior cingulate cortex and amygdala form the salience network (SN)60,69, and is also interconnected to the dorsolateral prefrontal cortex and ventral striatum66,67,68. Furthermore, the anterior insula has been entitled the “limbic sensory area” connected with the anterior cingulate cortex, the “limbic motor area”70,71,72. In other words, the anterior insula is associated with visceral sensation, and the anterior cingulate cortex is associated with autonomic and emotional control. Lastly, the salience network has been implicated in detecting both interoceptive and external salient changes, signal for recruitment of additional processing, orient attention toward and react to salient stimuli, and switching between inward-oriented (e.g. default mode network (DMN)73) and externally directed cognition (e.g. central executive network(CEN)60,69.

Following this, increased activity in insula and adjacent neurocircuitry may correspond to clinical observations in which MDD patients report, among other MDD symptoms, increased introspective self-focus, rumination, and preoccupation on bodily states (somatic-muscular, respiratory, gastrointestinal, genitourinary, and autonomic symptoms)53,74,75,76. BD patients, on the other hand, display overall inhibition, emotional dampening, heaviness, tiredness of the body, slower thinking, inner tension, and fearfulness74,75,76. In other words, MDD patients are more focused on the inside and preoccupied with self-focus on their inner world (rumination) and bodily symptoms, and BD patients are more inhibited, blunted, and fearful.

From the perspective of functional connectivity and networks, the observed increased self-focus on inner states corresponds with the increased activity within DMN found in both MDD and BD40,77,78,79,80. However, increased activity within the SN81 and increased connectivity between the SN (specifically insula) and DMN was found in MDD82,83,84, which suggests an increased input from visceral and bodily states in MDD. In BD, the connectivity within and from the SN seems to be impaired, suggesting a dysregulated salience associated with a dysregulation of emotional control85. A recent study on the connectivity dynamics in neural networks also found that the switching rate of DMN was decreased in both MDD and BD relative to controls, suggesting an inability to navigate away from internal emotional and cognitive states86. However, MDD displayed a lower switching rate in SN and striatum relative to healthy controls and BD, which corresponds with increased insular activity on one hand, and decreased regulation of DMN by SN on the other86.

Increased activity in the claustrum in MDD was a rather surprising finding. No included study reported this region as differentiating the two disorders. One explanation is that claustrum was a component of larger significant clusters in the included studies, but more prominent structures nearby, such as insula or putamen, were labeled instead. Moreover, automated anatomical labelling atlases used in neuroimaging only rarely include claustrum (see AAL v387). And conversely, it is possible that the true finding was insula and the claustrum was the result of an oversensitive atlas used in GingerALE software. Finally, the claustrum may be a result of a false-positive convergence of overlaps of surrounding significant structures. All these notions suggest caution in the interpretation of this finding.

The claustrum is still a poorly understood structure with high connectivity in the brain88,89,90,91. It is structurally and functionally connected with cortical and subcortical brain regions known to be compromised in neuropsychiatric disorders including the insula, anterior cingulate cortex, pre- and post-central gyrus, superior temporal gyrus, amygdala, and basal ganglia90,91,92,93; and likely acts as a relay node within several neural networks, which were proposed to be impaired in mental disorders94. Still, studies on claustrum in the context of neuropsychiatric disorders are sparse and its ontogenetic origin is still debated. From the sparse available research reduced volume95, hypoactivity78, and reduced metabolic activity96 in claustrum were reported in MDD; and reduced grey matter97 and increased metabolism98 was reported in BD. Currently, the claustrum is considered to belong to the insular cortex99, rather than the putamen and basal ganglia100. In this light, our findings suggest that distinct MDD and BD abnormalities in the claustrum are likely linked to altered insular functions and salience processing101.

The study has several limitations. The first cluster of limitations is related to the analyzed populations. MDD and BD differ in several characteristics that might affect the results but are difficult to control on both, the single-study level and the meta-analysis. Aggregated data showed different gender distribution and depression severity, and even age-comparable groups do not preclude dissimilar illness duration as the younger age at onset is more prevalent in BD. However, as MDD and BD groups were comparable in age, gender, and depression severity in all included studies, the difference on the whole-sample level likely had a limited impact on the ALE results.

Another issue that should be considered is the effect of different medications in MDD and BD. Two studies enrolled non-medicated participants, another two did not specify the medication status, but in the others, a higher rate of antipsychotics, lithium, and other mood stabilizers in BD groups was indicated. Especially antipsychotics are of relevance because they have been reported to lower or normalize insular activity in psychosis65,102,103. However, there are also studies showing opposite or no effect104. In addition, a former review has shown that in bipolar fMRI studies, both task-based and resting state, the antipsychotic medication had no altering effect on the results105. While the effect of the medication cannot be fully discarded without direct control, it is plausible to assume the lower insular activity in bipolar relative to unipolar depression is not a result of medication.

The next limitation is the small overall number of included experiments and the resulting sample sizes. This might have increased the likelihood that the ALE results were driven by a few experiments with a larger sample size34. Indeed, the largest study included by Yao et al.45 that represents one-fifth of the whole dataset (N = 131) is one of the two main contributors to the results; the second one is the study by Liu et al.40. And if either of the studies is removed from the dataset, the meta-analysis would yield no significant results. Importantly, other studies do contribute to the results as well, but the two studies are the leading contributors. Furthermore, Yao’s study is the only one with a mixed BD I and II sample and besides the sample size, this might be another issue to be considered.

The ALE method has limitations as well. One limitation is that ALE takes into account only reported foci, which may omit other significant regions within the reported clusters.

Another drawback of ALE is that studies with no reported significant findings are not accounted for in the calculation. There were studies in both contrast groups (BD > MDD and MDD > BD) that indeed did not report significant findings in both directions (see Table 1). This may have led to an overestimation of the findings.

The exclusion of ROI-to-Voxel or ROI-to-ROI studies, which was justified to accommodate ALE requirements, also presents a potential bias of non-reporting of significant results. On the other hand, the risk of bias by including such studies would render the results highly unreliable.

Finally, as stated above, ALE does only inform of non-random convergence of findings. This cannot fully dismiss the risk of identifying meaningless false-positive convergences (e.g. convergence in white matter, ventricles, etc.). In other words, it may pinpoint an insignificant region only because it was surrounded by true significant structures.

In conclusion, the current meta-analysis applied the ALE method to identify convergence of results of resting-state fMRI studies that compared MDD and BD currently depressed individuals. The results showed that the left insula and potentially claustrum might be involved in neural activity patterns during resting state that can be used as biomarkers differentiating both disorders. The finding is in line with clinical observations in which MDD patients display more pronounced symptoms (ruminative self-focus, focus on bodily symptoms) associated with abnormal functions of neurocircuitry involving insula and adjacent brain regions than BD patients. Future studies should confirm our findings on a larger sample, e.g. on more than ten datasets, and explore their sensitivity and specificity in differentiating both disorders.

Methods

Prior to the study initiation, the study was pre-registered on PROSPERO (https://www.crd.york.ac.uk/prospero/) with project ID CRD4201811443. The study was conducted in compliance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses) guidelines for systematic review and met-analysis106.

Search strategy and selection of studies

A systematic search was conducted on the PubMed, Web of Science (WOS) and Scopus databases. A secondary search was conducted on the Google Scholar (GS) database. All searches were confined from January 2000 to August 2020 and performed by MP. In PubMed, WOS and Scopus titles, keywords, and abstract searches were conducted using the following terms: (bipolar OR bipolar disorder) AND (unipolar OR depression OR depressive episode OR major depressive disorder OR depressive disorder) AND (fMRI OR functional magnetic resonance) AND (rest OR resting state). In GS, a combination of the same keywords was used and the search was limited to the first 200 articles sorted by relevance. After removing duplicates, the systematic search in PubMed, WOS and Scopus identified 1508 studies. Two researchers (MP, ES) independently screened the titles, keywords, and abstracts of the studies for relevance and filtered out non-English, non-peer-reviewed, and unpublished articles. Sixty-two articles passed the initial screening of PubMed, WOS and Scopus studies and their reference lists were searched for additional potential studies with no new results. Screening of GS studies did not identify any additional articles. Two researchers (MP, ES) evaluated the full-texts of the 62 articles on the following criteria for study inclusion: (1) age of participants 18+, (2) reported Montreal Neurological Institute (MNI) or Talairach whole-brain contrasts comparing BD and MDD subjects in rs-fMRI, (3) both compared samples currently depressed, (4) moderate severity of depression when enrolled (MADRS ≥ 20, HAMD ≥ 17), (5) at least ten subjects per group. Subsequently, excluding criteria were applied: (1) reported psychiatric or neurological comorbidity, (2) study included only bipolar type II participants, (3) seasonal depression, dysthymia, (4) psychosis, (5) only task-based MRI experiments reported, (6) independent component analysis (ICA) performed and only a specific component was examined, (7) only a priori region of interest (ROI) analysis or seed-based functional connectivity analysis performed. Potential disagreements between the evaluators were to be resolved by the third author (TN).

Exclusion criteria 6 and 7 were based on the recommendation from BrainMap (see http://brainmap.org/taxonomy/criteria.html) according to which studies that intentionally restrict the image acquisition and/or analysis to pre-selected ROI (or a specific ICA component) that is smaller than the whole brain should be excluded. Because the ALE algorithm assumes that activity in each voxel in the brain is equally likely to occur, foci from pre-selected ROIs may be reported as significant and result in an increase in false-positive rate. Studies, which were not based on BOLD (e.g. perfusion studies), were excluded as well.

From the 62 studies, 52 were excluded after full text read. The remaining ten rs-fMRI studies used amplitude of low frequency fluctuation (ALFF)40 and fractional ALFF (fALFF)42,43,44, regional homogeneity (ReHo)39,41,45,46,47, and degree centrality (DC)38. The process is summarized in the PRISMA flowchart shown in Fig. 2.

Figure 2
figure 2

PRISMA flowchart showing the process of identifying the articles included in the meta-analyses.

Study quality assessment

Individual study quality was assessed with a 10-point checklist (Supplementary material S1) based on previous meta-analytic studies107,108,109 for the quality of the sample sizes, diagnostic procedures, demographic and clinical parameters, the fMRI acquisition parameters, analysis method, and the quality of reported results. The quality of every included paper was reviewed by two authors independently (MP, ES). Ratings were compared and inconsistencies were discussed in order to obtain a consensus score.

Data extraction

From each included article MNI coordinates (e.g. foci) of reported peak values of significant between-group differences corrected for multiple comparisons were extracted (MP). No study used Talairach coordinates. Coordinates were assigned to two subgroups based on directionality to avoid that opposite findings across studies enhanced each other in the following analysis. The first group contained findings of increased DC, ReHo, ALFF, and fALFF in BD compared to MDD (BD > MDD). In the same manner, the second group included findings of increased resting-state measures in MDD compared to BD (MDD > BD). In each directionality subgroup, the lower n of the two samples (BD or MDD) from each study was assigned in the foci datasets, which were used in the meta-analytic calculation. The actual n of MDD and BD samples was used in calculations of heterogeneity (pooled analyses) of demographic and clinical variables across the studies. As each study may report findings in one or both directionalities and use different methods of analysis, findings in one directionality and/or analysis were referred to, according to the convention, as experiments. Hence, any study may contain coordinates from one or more experiments. Foci from different experiments conducted on the same samples, either within one study or in different studies, were grouped together. This was the case of two studies40,41 which compared the same MDD and BD sample, but used different analytical methods (e.g. ALFF and ReHo). Their findings for each directionality were grouped together in the main ALE meta analysis calculations and the samples were handled as one study in pooled analyses.

Scanning parameters of included rs-fMRI studies are available in Supplementary material S2.

Statistical analysis

GingerALE v3.02 (www.brainmap.org) software was used for the meta-analysis. After entry, the coordinates were masked using the conservative standard mask from GingerALE. ALE method treats each foci in a given experiment as a three-dimensional Gaussian probability distribution that represents the spatial uncertainty associated with the coordinate. The Full Width at Half Maximum (FWHM) of these Gaussian functions was determined automatically by GingerALE based on the number of subjects per experiment34. This accommodated the assumption that a larger sample size in an experiment provides more reliable approximations of the true activation effect and was therefore modeled by smaller Gaussian distributions and vice versa. Next, a model activation (MA) map was generated for each experiment by combining the probability distributions. To limit the cumulative effect of multiple foci close to each other within a given experiment, the non-additive approach was applied, which generated MA maps by taking the maximum probability across overlapping Gaussians37. Final ALE scores were computed on a voxel-by-voxel basis by taking the union across these MA maps. The resulting ALE image was thresholded on a cluster forming threshold of p < 0.0001 and corrected with a family-wise error correction with cluster-level inference threshold at p < 0.05 and 5000 permutations32.

The main ALE meta-analysis was conducted for both directionality subgroups (e.g. BD > MDD and MDD > BD). Supplementary ALE meta-analytic calculations were also conducted for each modality and directionality subgroup. Datasets with foci and ALE calculations are available at https://osf.io/e7y6c/ (Open Science Framework).

All analyses were calculated in MNI space. Anatomical labels were automatically assigned by GingerALE. Visualizations were created using Mango version 3.0.4 (http://ric.uthscsa.edu/mango/) and a high-resolution anatomical template with isotropic voxels in MNI space as distributed with GingerALE.