Introduction

Depressive disorder, a debilitating psychiatric disorder, is the leading cause of disability worldwide [1]. The World Health Organization (2019) estimates the global loss in productivity due to depressive disorders amounts to $1 trillion per year –a trajectory expected to rise [2]. Compositional variations or other dysregulations of the gut microbiome are increasingly believed to play key roles in the pathogenesis of mental illnesses [3,4,5,6,7,8]. The gut microbiome can potentially affect the brain through multiple pathways such as inducing metabolites of tryptophan [9, 10], stimulation of the vagal nerve [11, 12], inducing alterations of the hypothalamus-pituitary-adrenal axis [13], microbial production of human neurotransmitters [12], and stimulation of the immune system over a gut epithelium with higher permeability [14, 15] altogether summarized in the concept of the gut-brain axis [16], a bidirectional communication system between the gastrointestinal tract (GI) and the central nervous system (CNS) [17].

Several animal studies [11, 18,19,20,21,22] have consistently revealed that when the microbiome is transplanted from patients with depressive disorder to normal animals, depressive-like behaviors are induced, whereas transplantation from healthy hosts improves depressive symptoms. This demonstrates depression as a causal factor for microbiota change. Clinical studies [23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54] investigating the association between gut microbiome and depressive disorder have yielded mixed results. These inconsistent findings might be attributable to the use of psychotropic medication, country of study, age, sex and body mass index (BMI) of depressed patients, severity of depressive symptoms. Only two systematic reviews [55, 56] incorporated a meta-analysis of alpha diversity. Owing to the small number of studies included, Sanada et al. (2020) did not analyze the effects of confounders [55]. Nikolova et al. (2020) aimed at finding distinct or shared gut microbial alterations in psychiatric disorders, although they analyzed alpha and beta diversity and relative abundance of gut microbes in major depressive disorder (MDD), they did not analyze the effects of confounders in MDD [56].

The aim of our study was to compare gut microbiome between people with depressive disorder and healthy controls, using a systematic review and meta-analysis of available studies in the scientific literature. We will also explore sources of heterogeneity between studies using subgroup meta-analysis and meta-regression.

Methods

Search strategy and selection criteria

The protocol for this review was preregistered with PROSPERO (CRD42022315694). This systematic review and meta-analysis was conducted in adherence with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [57]. Eight electronic databases including PubMed, Web of Science, Embase, Scopus, Ovid, Cochrane Library, ProQuest, and PsycINFO were searched on 7 March 2022 using the key terms “(depressive disorder OR depressive syndrome OR unipolar depression) AND (gastrointestinal microbiome OR gut microbiome OR gut microflora OR gut microbiota OR gastrointestinal flora OR gut flora OR gastrointestinal microbiota OR gastric microbiome OR intestinal microbiome)”. To describe the concept of implementation, Medical Subject Heading (MeSH) terms were used in PubMed and Emtree terms were used in Embase. We did not apply any restrictions on study design or publication data. To identify other potentially relevant studies, the reference list of reviews that were excluded from this study was manually searched [55, 58,59,60,61,62,63]. Search strategies were then re-run on 12 March 2023 for an update.

Titles and abstracts were independently screened by two authors (MG and JW) to identify possible articles for full-text retrieval. Inconsistencies in screening decisions were solved by consulting a third author (KZ). Firstly, titles and abstracts resulting from the search strategy were selected if they met the following inclusion criteria: (i) cross-sectional studies or reported baseline data from longitudinal studies of gut microbiota composition comparing patients who had depressive disorder with healthy controls, (ii) performed gut microbiota analysis and reported diversity or abundance measures, and (iii) published as full-text articles in peer-reviewed scientific journals. Correspondingly, the exclusion criteria were: (i) they examined the gut microbiota and anxiety/depression/bipolar disorder symptoms solely in another psychiatric disorder or disease, (ii) assessed the effect of an intervention without reporting relevant baseline measurements, (iii) no healthy controls, and (iv) published as reviews, case reports, conference abstracts, or letters. Next, the full text of relevant papers was then assessed for eligibility for inclusion using the same criteria.

Data extraction

Data were extracted and cross-checked by two independent authors (MG and JZW) using a predesigned template. We used Endnote to remove duplicate data. Information gathered for each study included the following items: name of the first author, publication year, the country in which the study was in, sample size, definition of disorder, age, sex (%female), body-mass index (BMI), smokers, alcohol, medication (such as antipsychotics, mood stabilizers, and antidepressants), Hamilton Depression Rating Scale (HDRS) scores, sequencing, diversity assessments, and methodological information. As primary outcomes of interest, we extracted community-level measures of gut microbiota composition (alpha and beta diversity) and taxa composition at phylum, order, family, and genus levels (relative abundance). Alpha diversity provides a summary statistic of the microbial community, whereby higher alpha diversity indicates a greater number of species (i.e., “richness”), with more even representation (i.e., “evenness”), and/or greater biodiversity according to the ancestral dissimilarity of species (i.e., “phylogenetic diversity”) [64, 65]. Beta diversity is an inter-individual measure that examines the similarity of communities relative to the other samples analyzed [66].

Data analysis

To evaluate the quality of the studies, two authors (MG and JW) independently used the Newcastle-Ottawa Scale (NOS) for the observational studies [67, 68]. The tool was developed to assess the quality of nonrandomized studies with its design, content, and ease of use directed to the task of incorporating the quality assessments in the interpretation of meta-analytic results. A ‘star system’ has been developed in which a study is judged on three broad perspectives: the selection of the study groups; the comparability of the groups; and the ascertainment of either the exposure or outcome of interest for case-control or cohort studies respectively. In cases of disagreement between the authors in some aspects of the evaluation, a third reviewer (KZ) was consulted to make the final decision.

The systematic review and meta-analysis consisted of three steps. First, we did the overall analysis for two between-group meta-analyses comparing alpha diversity in patients with depression, with those in healthy controls (cross-sectional studies). Additionally, we did a prespecified subgroup analysis of patients in depression by psychotropic medication and regional variations. We did subgroup analyses for these two specific moderators in view of their well-known effects on gut microbiota and their clinical implications [69,70,71,72,73,74]. We did meta-regression analyses to investigate possible moderators of alpha diversity. Restricted maximum likelihood random-effects meta-regressions [75, 76] of effect size were done with regional variations(east/west), use of psychotropic medication, mean age, sex, BMI, and severity of disease (as assessed by HDRS) as moderators. Studies were weighted such that the studies with the most precise parameters, quantified by the sample size and 95% confidence interval (CI), had more influence in the regression analyses.

Because studies used different measurement methods, we used standardized mean difference estimates of the differences in alpha diversity between patients and healthy controls as the effect size. A random-effects meta-analysis on Cohen’s d standardized mean difference (SMD) was performed applying the inverse-variance method, which allows population-level inferences and is more stringent than fixed-effect models [77, 78], and also calculated the corresponding 95% CIs. Random-effect modeling assumes a genuine diversity in the results of various studies and incorporates a between-study variance into the calculations [78]. The effect size was categorized as a low effect (cut-off level 0.2), meaning a small difference in alpha diversity between patients and controls, a moderate effect (cut-off level 0.5), and a large effect (cut-off level 0.8) [79]. The direction of the effect size was positive if patients with depressive disorder had increased alpha diversity, and negative if they had decreased alpha diversity compared with controls in the between-group meta-analyses. We assessed heterogeneity across studies using the Cochran Q test, a weighted sum of the squares of the deviations of individual study effect size estimates from the overall estimate, and considered a p value of less than 0.10 significant [80]. Inter-study heterogeneity was quantified using the DerSimonian–Laird estimator, reported with the I2 statistic and interpreted the percentage of total variation across several studies as a result of heterogeneity, and heterogeneity was considered moderate when I² is between 50% and 75%, and high when I² is greater than 75% [81]. We did sensitivity analyses to ascertain whether the results of our analyses were strongly influenced by any single study or a cluster of studies sharing some characteristic. The overall significance was recomputed after each study or group of studies with a common feature were deleted from the analysis. Publication bias was evaluated with funnel plots and Egger’s regression test [82, 83]. The level of significance for effect size estimates was set at p < 0.05.

Second, for beta diversity, we performed a qualitative synthesis in order to examine the differences of communities in depression compared with healthy control.

Third, for the relative abundance of microbial taxa, we performed a qualitative synthesis owing to the large number and limited overlap of findings. We summarized findings for each taxon reported in at least two studies and labeled those increased, decreased, or no difference. Because Firmicutes and Bacteroidetes are two major focuses in the human studies related to the gut microbiota and depression [84, 85], we conducted a quantitative synthesis. Medians and inter-quartile ranges were transformed to means (M) and standard deviations (SD) using a web-based tool [86]. Where necessary, numerical data were extracted from graphs using GetData Graph Digitizer (v.2.20) [87]. We used StataMP version 16.0 for all meta-analyses [78].

Results

Search results

The electronic search yielded 3380 papers. Additionally, 2 other records were identified as likely relevant to the review through other sources. The study selection process is presented in Fig. 1. Characteristics of included studies and quality of studies are described in the Tables 1 and S1 separately.

Fig. 1
figure 1

Study selection for meta-analysis of gut microbiota in depressive disorder.

Table 1 Characteristics of studies included in the meta-analysis.

Characteristics of included studies

The selected articles consisted of 44 case-control studies [19, 22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54, 88,89,90,91,92,93,94,95,96,97]. The total number of participants was 4883 (2091 [42.8%] in the depressive group and 2792 in healthy groups). The mean number of included patients in the studies was 48 (range 7–167), and the mean number of healthy controls was 63 (range 10 to 1071). Thirty-two studies (72.7%) were conducted in East Asia (China [19, 24, 27, 28, 30, 32,33,34,35,36, 38, 41, 42, 47,48,49,50,51,52,53,54, 89, 90, 93, 94, 96, 97], Japan [23], Korea [91], and Taiwan [29, 88, 95]), 12 (27.3%) in westernized populations (USA [37, 39, 43,44,45], Australia [26], Norway [40], Spain [25], Italy [31], Ireland [22], Russia [92], and Belgium [46]), grouped according to typical diet and lifestyle. Studies were similar in exclusion criteria, however, few attempted to minimize dietary changes or control dietary intake (7 [29, 31, 38, 47, 49, 91, 95] of 44 [15.9%]). The diagnosis of depression was assessed using the MINI, the DSM, and the ICD-10, while one study [96] assessed using only the HDRS, one [46] using GP-reported, one [91] using CES-D and one [49] using PHQ-9. In addition, 28 studies (63.6%) used the HAMD to examine the severity of depression symptoms [19, 22,23,24,25, 27, 28, 30, 33,34,35,36, 41, 42, 47, 48, 50,51,52,53,54, 89, 90, 92,93,94,95,96]. Only seven of all studies did not report BMI of participants [26, 37, 44, 45, 51, 93, 96]. Smoking status (17 [19, 22, 25,26,27,28,29, 31, 33, 35, 37, 42, 49, 50, 89, 93, 97] of 44 [38.6%]) and alcohol consumption (17 [22, 24,25,26,27,28, 30, 31, 39, 42, 45, 49, 51, 89, 93, 94, 97] of 44 [38.6%]) was reported. Use of psychiatric medication also varied substantially, with 18 [24, 27, 30, 32, 36, 38, 42, 48, 50, 51, 53, 54, 89, 90, 92,93,94, 96] of 44 studies (40.9%) conducted in medication-free or drug-naive groups, 21 [19, 22, 23, 25, 26, 28, 29, 31, 33,34,35, 37, 39, 41, 44,45,46,47, 52, 88, 97] of 44(47.7%) in groups undergoing treatment and the remainder not controlling this, resulting in anywhere between 10.9% and 100% of patients taking medication. Composition analysis (Table 1) varied widely, with 16 S ribosomal RNA sequencing being most common (35 [19, 22, 24,25,26, 29,30,31,32,33, 35,36,37,38,39,40, 42, 44,45,46, 48,49,50, 52,53,54, 88,89,90,91, 93,94,95,96,97] of 44 studies [79.5%]) followed by 1 study (2.3%) [23] using real-time quantitative polymerase chain reaction(RT-qPCR),1 study(2.3%) [28] using metaproteomics, 5 (11.4%) [34, 41, 43, 47, 92] using shotgun metagenomics and 1 study(2.8%) [27] using not only 16 S ribosomal RNA but also shotgun metagenomics. Methodology of stool processing (Table S2) showed that the far most common storage temperature was −80° (36 [19, 22, 25, 26, 28,29,30,31,32,33,34, 36,37,38,39,40,41,42,43,44,45, 48,49,50,51, 53, 54, 88,89,90, 92,93,94,95,96,97] of 44[81.8%]).

Alpha diversity

A total of 35 case-control studies in depressive disorder examined alpha diversity indices (Table S3) [19, 22, 24,25,26,27, 29, 30, 32,33,34, 36,37,38, 40,41,42, 45, 47,48,49,50, 52,53,54, 88,89,90,91,92,93,94,95,96,97]. Because four studies did not provide relevant data [26, 49, 50, 53], thirty-one studies were included in meta-analyses [19, 22, 24, 25, 27, 29, 30, 32,33,34, 36,37,38, 40,41,42, 45, 47, 48, 52, 54, 88,89,90,91,92,93,94,95,96,97]. Eleven indices were used to assess alpha diversity, including estimates of richness (observed species, Chao1, abundance coverage estimator [ACE]), biodiversity (Shannon, Simpson, inverse Simpson, Pielou’s, Fisher, Faith’s phylogenetic diversity), and 3 newly developed indices [41, 53, 97]. The most widely used were observed species index, Chao1 index, ACE index, Shannon index, Simpson index and phylogenetic diversity index. The observed species index, Chao1 index and ACE index reflected the abundance of the community. The Shannon index, Simpson index, and phylogenetic diversity index reflected the diversity of the community.

Regarding richness, 11 studies provided data on observed species in patients (n = 650) vs controls (n = 1421). The pooled estimate showed no significant difference between groups (SMD = −0.08; 95% CI, −0.24 to 0. 08; P = 0.337) and no significant heterogeneity (Fig. 2A) [19, 22, 27, 36, 37, 40, 45, 54, 91, 94, 96]. Chao1 data were provided by 19 studies (1045 patients and 930 controls). There was a no significant difference between groups (SMD = 0.10; 95% CI, −0.30 to 0.50; P = 0.608), with high heterogeneity (I2 = 93.7%) (Fig. 2B) [22, 24, 27, 30, 32, 33, 36, 41, 42, 45, 47, 48, 52, 89, 90, 93, 94, 96, 97]. Eleven studies reported data on ACE in patients (n = 604) vs controls (n = 524).There was a no significant difference between groups (SMD = 0.04; 95% CI, −0.51 to 0.58; P = 0.894), with high heterogeneity (I2 = 93.5%) (Fig. 2C) [27, 30, 32, 33, 42, 45, 52, 90, 93, 96, 97].

Fig. 2: Forest plots of alpha diversity in the gut microbiota of patients with depressive disorder compared with healthy controls.
figure 2

A Observed; B Chao1; C ACE; D Shannon index; E Simpson index; F Phylogenetic diversity.

Regarding diversity, random-effects between-group meta-analysis showed that there was no significant difference in patients with depressive disorder compared with healthy controls on Shannon index whose data was provided by 29 studies (1506 patients, 2293 controls; SMD = −0.22; 95% CI, −0.22 to 0.21; P = 0.955) with high heterogeneity (I2 = 87.4%) (Fig. 2D) [19, 22, 24, 25, 27, 29, 30, 32,33,34, 36,37,38, 41, 42, 45, 47, 48, 52, 54, 88,89,90,91,92,93,94,95, 97]. Simpson index data were provided by 14 studies (770 patients, 735 controls). There was no significant difference between groups (SMD = −0.22; 95% CI, −0.54 to 0.10; P = 0.184), with high heterogeneity (I2 = 88.1%; Fig. 2E) [19, 24, 27, 30, 33, 36, 40,41,42, 47, 52, 90, 93, 94]. Finally, 11 studies provided phylogenetic diversity data in patients (n = 634) vs controls (n = 1431). The pooled estimate showed no significant difference between groups (SMD = −0.11; 95% CI, −0.25 to 0.03; P = 0.138) and no significant heterogeneity (Fig. 2F) [19, 22, 24, 32, 36, 37, 45, 54, 91, 94, 95].

In subgroup analyses of patients with depression by regional variations(east/west), Chao1 was different in the depression group. Patients who were in the West had fewer number of species (SMD = −0.42; 95% CI, −0.74 to −0.10; P = 0.011; Fig. 3A). Subgroup meta-analyses using regional variations(east/west) and psychotropic medication as predictor variables were not significantly different as assessed by Shannon index (Fig. 3C, D).

Fig. 3: Subgroup meta-analysis of Chao 1 index and Shannon index.
figure 3

A Chao1; B Chao1; C Shannon; D Shannon.

In our investigation of other sources of heterogeneity using meta-regression analyses, meta-regression models, severity of depressive symptoms as assessed by HDRS, was not related to the magnitude of the effect size, indicating no association between severity of symptoms and alpha diversity (Chao1 and Shannon) (Fig. S1A, D). Additionally, we found no relation between age, sex, and BMI in depression and alpha diversity (Chao1 and Shannon; Table S4).

Beta diversity

Of the 44 case-control studies, 34 studies analyzed beta diversity, using a variety of measures (Table S5) [19, 22, 24,25,26,27, 29, 30, 32,33,34,35,36,37, 39, 41, 42, 44, 45, 47, 48, 50,51,52, 54, 88,89,90,91,92, 94,95,96,97]. Consistent nonsignificant differences were reported by 12 studies, and a further 3 reported conflicting results between the measures used. Nineteen studies found significant differences in beta diversity between patients with depressive disorder and controls (Table S5). Mason et al. [39] found no difference between participants with depression and controls when stratified by diagnosis, but hierarchical clustering of beta diversity identified two participant groups associated with anhedonia scores derived from self-report questionnaires (weighted UniFrac). These findings suggest there is reliable evidence for differences in the phylogenetic relationship in depressive disorder compared with controls, however, method of measurement and method of patient classification (symptom vs diagnosis based) may affect findings.

Summary of representative taxa in the observational trials

Of the 44 studies which analyzed the gut microbiota in depressive disorder, 19 studies(43.2%) presented taxa specific results based on a Linear discriminant analysis effect size (LEfSe) analysis and all reported findings with a Linear discriminant analysis (LDA) scoreå 2 or ≤2 [24, 27, 32,33,34, 36, 37, 42, 47, 48, 50,51,52, 54, 90, 94,95,96,97]. Most studies identified significant differences between patients and controls at phylum, family, or genus levels. Owing to the significant likelihood of false [62], we summarized findings for each taxon reported in at least 2 studies and labeled those increased, decreased, or not changed (Figs. 4 and S2).

Fig. 4: Changes in relative abundance of microbial taxa reported by at least 2 studies.
figure 4

A Level Phylum; B Level Family; C Level Genus; D Level Species.

At the phylum level, 18 studies provided data on Firmicutes in patients (n = 665) vs controls (n = 609) [19, 25, 29, 31,32,33, 35, 37, 38, 41, 42, 45, 48, 51, 54, 88, 92, 97]. The pooled estimate showed no significant difference between groups (SMD = −0.51; 95% CI, −1.15 to 0.14; P = 0.123), with high heterogeneity (I2 = 96.0%; Fig. 5A). Bacteroidetes data were provided by 18 studies (760 patients and 753 controls) [19, 25, 29, 31, 33, 35, 37, 38, 41, 42, 45, 48, 51, 52, 54, 88, 92, 97]. There was a no significant difference between groups (SMD = 0.02; 95% CI, −0.58 to 0.62; P = 0.952; Fig. 5B). Results of our subgroup meta-analyses for psychotropic medication showed that Firmicutes remained significantly different in patients with depressive disorder who were medication free (SMD = −1.54; 95% CI, −2.36 to −0.72; P = 0.033; Fig. 6B), and Bacteroidetes also remained significantly different in patients with depressive disorder who were medication free (SMD = 0.90; 95% CI, 0.07 to 1.72; P < 0.001; Fig. 6D). Meta-regression models, severity of depressive symptoms as assessed by HDRS, was not related to the magnitude of the effect size, indicating no association between severity of symptoms and phylum level (Firmicutes and Bacteroidetes) (Fig. S1C, D). Additionally, we found no relation between age, sex, and BMI in depression and phylum level (Firmicutes and Bacteroidetes; Table S4). An overview of the family level is provided in Fig. 4A.

Fig. 5: Forest plots of phylum level in the gut microbiota of patients with depressive disorder compared with healthy controls.
figure 5

A Firmicutes; B Bacteroidetes.

Fig. 6: Subgroup meta-analysis of Firmicutes and Bacteroidetes.
figure 6

A Firmicutes; B Firmicutes; C Bacteroidetes; D Bacteroidetes.

At the family level, over two studies consistently found that the abundance of Actinomycetaceae, Enterococcaceae, Leuconostocaceae, Porphyromonadaceae, and Streptococcaceae were higher in depression relative to controls. Prevotellaceae was lower in depressive disorders reported by nine studies [22, 24, 28, 29, 33, 48, 50, 51, 97] and only one [31] reported higher abundance. An overview of the family level is provided in Fig. 4B.

At the genus level, at least 3 studies reported that depleted levels of Butyricicoccus, Coprococcus, Eubacterium_ventriosum_group, Faecalibacterium, Fusicatenibacter, Romboutsia, Subdoligranulum and enriched Eggerthella, Enterococcus, Escherichia, Flavonifractor, Holdemania, Lachnoclostridium, Paraprevotella, Rothia, and Streptococcus were consistently shared in depressive disorder. Megamonas was lower in depression in four studies [19, 27, 29, 38], although the opposite was observed in one study [33]. A higher abundance of Oscillibacter and Parabacteroides was reported in depression in all the reported studies, except one study [38] reported the opposite result. Finally, lower levels of Odoribacter were observed in three studies in depression, [38, 41, 42] although the opposite was observed in one study [36]. An overview of genus level is provided in Fig. 4C.

The vast majority of included studies were based on 16S rRNA gene sequencing, and consequently, they do not provide enough taxonomic resolution to report results at the species level. Nevertheless, at least 3 case-control studies independently reported several overlapping findings, including a higher relative abundance of Bacteroides_caccae, Bacteroides_fragilis, and a lower relative abundance of Faecalibacterium_prausnitzii in depressive disorders relative to controls. An overview of the species level is provided in Fig. 4D.

We explored the association of study region (east/west) with microbial alterations. Owing to the imbalanced availability of studies by region (70.5% included studies were largely investigated in the east), this analysis should be considered preliminary. Clustering according to region identified several taxa that were altered only in over two studies from Eastern countries: Adlercreutzia, Alloprevotella, Barnesiella, Clostridium_sensu_stricto, Enterobacter, Escherichia-Shigella, Fusobacterium, Haemophilus, Megamonas, Megasphaera, Odoribacter, Olsenella, Parasutterella, Parvimonas, Prevotella_2, Ruminococcus_torques_group, Sutterella, and Veillonella were not consistent; Rothia and Sphaerochaeta were increased; Romboutsia and Eubacterium_ventriosum_group were decreased. These differences were driven entirely by studies from China, highlighting the need to distinguish the Chinese microbiome from other East Asian nations as more evidence becomes available.

We also investigated effects of psychiatric medication on microbiota composition. We found that increased in the family Actinomycetaceae, Enterococcaceae, Streptococcaceae and the genera Holdemania, Rothia, and Streptococcus were only reported in medicated groups, while Dialister was decreased in medicated and increased in medication-free groups. These results indicated psychiatric medication can affect microbiota composition.

Results from the sensitivity analyses, and publication bias are shown in the appendix (Figs. S3 and S4).

Discussion

We assessed gut microbiota alterations of depressive disorder with the aim of evaluating the reproducibility and specificity of potential gut microbial biomarkers. The systematic review revealed substantial methodological differences between the included studies both regarding demography and storage of fecal samples with differences in analyses methods and reporting of study findings. It reports that there were no significant differences in patients with depressive disorder on alpha diversity indices, Firmicutes and Bacteroidetes compared with healthy controls. In subgroup analyses with regional variations(east/west) as a predictor, Patients who were in the West had a lower Chao1 level (moderate effect size). Subgroup meta-analysis showed that Firmicutes level was decreased in patients with depressive disorder who were medication free (large effect size), but Bacteroidetes level was increased (large effect size). Subgroup meta-analyses using regional variations(east/west) and psychotropic medication as predictor variables were not significantly different as assessed by Shannon index. In the meta-regression analysis, six variables—regional variations (east/west), use of psychotropic medication, age, sex, BMI, and HDRS—cannot explained the 100% heterogeneity of the studies assessing by Chao1, Shannon index, Firmicutes and Bacteroidetes; which would suggest the presence of unmeasured moderators. All studies reviewed found significant differences in taxa between depression and control groups. However, there was minimal consensus regarding either microbial diversity or relative abundance or directionality of differences in taxa associated with depressive disorders. The most consistent changes in the microbiota were a higher relative abundance of Eggerthella, Enterococcus, Escherichia, Flavonifractor, Holdemania, Lachnoclostridium, Paraprevotella, Rothia, Streptococcus and lower for Butyricicoccus, Coprococcus, Faecalibacterium, Eubacterium_ventriosum_group, Fusicatenibacter, Romboutsia, Subdoligranulum at genus level.

Lower diversity in patients relative to controls has been reported in some diseases and mental health disorders [98,99,100,101,102], however, our quantitative synthesis of included studies revealed no significant differences in the SMDs of the observed species, Chao1, ACE, phylogenetic diversity, Shannon and Simpson Index between patients with depressive disorder and healthy controls. The use of alpha diversity indices in biomedical research stems from the assumption that “higher diversity is somehow more meritorious ecologically”, and that diversity of species provides a proxy for microbial function and stability that is assumed to be favorable for the host [64]. Nevertheless, the present review reveals that this conclusion is, at present, unfounded in depression. Regarding beta diversity (between samples), patients with depressive disorder consistently clustered differently from controls. However, we could not conduct a meta-analysis due to insufficient data.it is yet unknown whether depressive disorder cluster differently from other psychiatric disorders, thus questioning the suitability of diversity measures as biomarkers. Moreover, existing multivariate analyses which compare beta diversity between groups often involve dichotomization based on the absence or presence of a condition (PCoA/PERMANOVA). Research in depressive disorder thus encounters several methodological challenges not faced by conditions with defined biomarkers, including the disparate methods by which clinical groups are defined (e.g., self-reported symptom levels, psychiatrist diagnosis, clinical interviews). Accordingly, incorporation of bias due to nosology requires consideration in future research.

Notwithstanding Firmicutes and Bacteroidetes are observed to alter in the human studies related to the gut microbiota and depression [84], however, our quantitative synthesis of included studies revealed no significant differences in the SMDs of the Firmicutes and Bacteroidetes between patients with depressive disorder and healthy controls.

Although diversity and phylum level findings were nonsignificant differences, specific bacterial taxa were implicated in studies that compared the gut microbiota of depression groups relative to controls. Among the most consistent findings was a lower abundance of Butyricicoccus, Coprococcus, Faecalibacterium, Fusicatenibacter, Eubacterium_ventriosum_group, Romboutsia, Subdoligranulum, as well as a higher abundance of Eggerthella, Enterococcus, Escherichia, Flavonifractor, Holdemania, Lachnoclostridium, Paraprevotella, Rothia, and Streptococcus in participants with depressive disorder relative to controls. Several mechanisms by which these taxa may be associated with depression, focusing on increased proinflammatory communication and anti-inflammatory communication via the gut-brain axis.

Several taxa reported to have a higher relative abundance in clinical depression were associated with gastrointestinal inflammation (i.e., Enterococcus, Eggerthella) [103,104,105]. Inflammation has been widely suggested as a contributor to the pathogenesis of depressive disorders [106, 107]. Consistently, growing shreds of evidence of continuous low-level immune-inflammatory reaction also cannot be ignored and the source of this immune inflammation reaction is probably related to the disorder of gut microbiota [19, 22, 108, 109]. The potential for microbiota-mediated inflammation in depression is not only indicated by the increase in inflammation-associated microbial members but may be further exacerbated by a loss of species that secrete anti-inflammatory metabolic products. Our review revealed a reduction of Faecalibacterium, Coprococcus, and Butyricicoccus which have been demonstrated to secret anti-inflammatory the short-chain fatty acid (SCFAs) particularly butyrate in depression groups relative to controls [110,111,112], and Sutterella, Megamonas which can produce acetate and propionate [113]. For Oscillibacter, which was observed increase, the type strain of this genus has valeric acid as its main metabolic end product. [114] Gut microbiota can also regulate brain function by influencing tryptophan metabolism [115] Some bacteria, such as Enterococcus, Lactobacillu and Oscillibacter, which encode Trp synthase genes, are found to increase mainly in included studies. Trp and 5-HTP (precursor of 5-HT) can pass through the blood-brain barrier (BBB) and become the precursors of 5-HT in the brain.

Many of the included studies were conducted in Asia regions. Geographic/ethnic dietary differences may be expected to affect the gut microbiome directly [74], differences in microbial composition vary considerably by geographical location [69]. Our study revealed that patients who were West had a lower Chao1 level, but the result was not observed in Shannon index, indicating regional variations(east/west) may only influence number of species of intestinal flora, not affect the evenness of gut microbiota. Given the small number of studies in the subgroup meta-analysis, this limits the interpretation and generalization of findings, therefore, additional studies are needed for a definitive conclusion on this issue. Consistently, a study of regionally heterogeneous participants did find a signal for mental illness status that was reproducible among subsets by region, although the mental illness status was determined by self-report and covered several conditions in addition to depression [69]. Almost all studies did not consider the effects of diet, some of the selected studies collected the diet information by interview [38], questionnaire [29, 47, 49, 91, 95], and dietary records [31]. It is well known that diet would potentially affect the distribution of microbiota and their function [116]. Consumption of a high-fat and animal protein diet has also been associated with elevated Actinobacteria [117], and the majority of studies that reported differences in Actinobacteria did not control for diet [19, 27, 28, 33,34,35, 41, 45, 51, 54, 89, 90]. Further investigations which adequately model dietary intake are required to disentangle whether associations are driven by dietary intake or independently associated with depression.

There was evidence in the literature that psychotropic medications may impact the gut microbiome, for example, atypical antipsychotics are associated with altered gut microbiota in rodents [70,71,72]. In a study of bipolar adults, significant changes in the abundance of three genera were identified between those taking vs. not psychotropic medications [73]. In our study, Subgroup meta-analysis showed Firmicutes level was decreased in patients with depressive disorder who were medication free, but Bacteroidetes level was increased, indicating psychotropic medications may influence gut microbiota. Accordingly, incorporation of bias due to psychotropic medications requires consideration in future research.

The selected studies had a wide range of ages. In general, there are differences in the distribution of microbiota according to age [118, 119]. For example, Firmicutes is the dominant taxa during the neonatal period, but Actinobacteria and Proteobacteria are about to increase in three to six months [120]. While in adults, Vemuri et al. [121] reported that Bacteroidetes and Firmicutes were the dominant taxa. Meanwhile, compared to younger individuals, the abundance of Bacteroidetes is significantly higher in frailer older individuals [122]. Similarly, Chen et al. (2020) identified 6 and 25 differentially abundant bacterial taxa responsible for the differences between MDD patients (young and middle-aged, respectively) and their respective HCs [123]. However, we found no relation between age in depression and Chao1, Shannon index, Firmicutes and Bacteroidetes. Given the small number of studies in the meta-regression, this limits the interpretation and generalization of findings, therefore, additional studies are needed for a definitive conclusion on this issue.

There is clear sex difference in the prevalence of depressive disorder, both being more common in females than males [1]. Sex differences in gut microbiota composition have also been suggested to underlie susceptibility to gut-microbiota-mediated conditions in females [124]. In the few studies that examined males and females separately, significant differences were observed, sometimes with effects in opposite directions (e.g., Lachnospiraceae, Coriobacteriaceae, and Erysipelotrichaceae incertae sedis) [125]. However, we found no relation between sex in depression and Chao1, Shannon index, Firmicutes, and Bacteroidetes.

In the meta-regressions we found no association between depressive symptoms, BMI in depression and Chao1,Shannon index, Firmicutes, and Bacteroidetes.

Based on our narrative synthesis heterogeneity of studies was visible and studies reporting no significant results were prominent, yet tend to not report sufficient data for inclusion in meta-regressions, resulting in a bias in the meta-regressions on significant effects. These limitations may have resulted in false-negative result in our meta- regressions, and results should thus be interpreted with caution. Therefore, additional studies are needed for a definitive conclusion on the sources of heterogeneity.

The use of metaproteomics, shotgun metagenomic, and RT-qPCR affects comparability with the other studies, which used 16 S RNA quantification. Furthermore, in contrast to most of the studies, three studies limited their search of the gut microbiome a priori to specific taxa: Firmicutes and Bacteroidetes [28, 32] or Bifidobacterium and Lactobacillus [23]. Even though most studies used high throughput sequencing of 16 S rRNA, the analysis methods involved different variable regions, different pipelines, different databases, and different cut-offs, which may each influence results to varying degrees. Moreover, analytic methods differed widely among the studies. Statistical methodology for microbiome analysis has not been standardized across the field, and many approaches have been noted to be prone to high false discovery rates [126].

An important strength of our systematic review was the search strategy, since we have used a range of databases and made an exhaustive effort to acquire data. Our study has some inherent limitations. First, the meta-regressions might have failed to achieve statistical significance because of a lack of power in these specific analyses, thus giving a false-negative result. Second, our meta-analyses on alpha diversity indices and phylum level in patients with depressive disorder compared with healthy controls provided us with pooled results originating from cross-sectional studies, and we therefore cannot draw any conclusions on causality. Third, waist circumference is more accurate than BMI for assessment of visceral adiposity. Since most studies included in our analysis did not provide data on waist circumference, we used BMI as a surrogate for visceral adiposity. Finally, any meta-analysis is dependent on the quality of the analyzed studies, and our results need to be verified by studies specifically designed to test the points we raised. Finally, poor quality studies were included and included studies have publication bias. The limitations should be considered in future syntheses. Many other approaches to the complex field of the gut and brain than case-control studies can be carried out and it is yet far too early to conclude that the gut microbiota is different, and if it is the matter of causality remains to be addressed. Studies with more participants (especially of psychosis and major depressive disorder) are needed, and these studies should focus not only on the gut microbial composition but also on the function, carefully taking confounders into account, thereby making it possible to analyze data in a wider perspective than to date.

Conclusion

This systematic review and meta-analysis found that psychotropic medication and dietary habit can influence microbiota. There is reliable evidence for differences in the phylogenetic relationship in depressive disorder compared with controls; however, method of measurement and method of patient classification (symptom vs diagnosis based) may affect findings. Depressive disorder is characterized by a reduction of anti-inflammatory butyrate-producing bacteria, while pro-inflammatory genera are enriched. Future research should assess confounders and examine microorganism function to prevent unmerited claims of disorder specificity of gut microbial biomarkers.